System and method for detecting, collecting, analyzing, and communicating event-related information

ABSTRACT

A system and method involves detecting operational social disruptive events on a global scale, assigning disease or non-disease event staging and warnings to express data in more simplistic terms, modeling data in conjunction with linguistics analysis to establish responsive actions, generating visualization and modeling capabilities for communicating information, and modeling disease or non-disease propagation for containment and forecasting purposes.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of Ser. No. 13/090,742 filed Apr. 20, 2011, which claims priority to U.S. Provisional Patent Application Ser. No. 61/325,940, filed Apr. 20, 2010, and is a continuation-in-part of U.S. patent application Ser. No. 12/629,706, filed Dec. 2, 2009 (now U.S. Pat. No. 8,881,040), which is a continuation-in-part of U.S. patent application Ser. No. 12/230,397, filed Aug. 28, 2008 (now U.S. Pat. No. 9,489,495), which is related to and claims priority to U.S. Provisional Patent Application Ser. No. 61/064,256, filed Feb. 25, 2008, 61/046,275, filed Apr. 18, 2008, and 61/077,713, filed Jul. 2, 2008, the contents of all of which are incorporated herein in their entirety.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

This invention is related to federally sponsored research and development under ITIC contract number 2006-1016 426-000, TATRC contract numbers W81XWH-04-1-0857 and DAMD17-94-V-4015, NLM contract number N01-LM-3-3306, DC DOH contract number PO-HC-2004-P-1545, and OSC contract number 2008-1176516-000. The invention was made with U.S. government support. The U.S. government has certain rights in the invention.

BACKGROUND OF THE INVENTION Field of the Invention

The present invention involves detecting and tracking global events, such as communicable disease outbreaks, and collecting, analyzing, and reporting information relating to those events to decision makers and the public using various communications modes.

Description of the Related Art

Malaria, diphtheria, cholera, smallpox, influenza, and plague, among many other diseases, have had devastating consequences throughout human history. The first recorded pandemic of plague is thought to have begun in Egypt around 540 AD and to have swept through Europe, Africa, and parts of Asia, killing an estimated 50 percent of the population in those areas. A second pandemic, the Black Death of 1346 to 1352, killed an estimated 20 to 30 million people in Europe, or approximately one-third of the continent's inhabitants. Echoes of that pandemic reverberated for centuries, resulting in major political, cultural, and religious upheaval. A third pandemic, which is ongoing, originated in China in the late nineteenth century and brought plague to the United States early in the twentieth century. By 1718, this third pandemic had claimed an estimated 12 million lives in India alone.

At the root of those pandemics was Yersinia pestis, the bacterium that causes three different kinds of plague: bubonic, characterized by bubos, often in the groin or axillae; pneumonic, characterized by involvement of the lungs; and septicemic, characterized by bacteremia. All forms of the disease have high mortality rates when left untreated. Moreover, the disease can be spread not only by insect vectors (e.g., fleas that have fed on infected rodents), but also, in the pneumonic form, from person to person via infectious aerosols and droplets. Human migration and travel spread the disease over long distances.

While plague is more virulent than many other infectious diseases, its infamy in terms of geographic coverage and numbers infected is hardly atypical. Epidemic and pandemic measles, tuberculosis, and cholera have exacted staggering death tolls. Near the end of World War I in 1718, a vicious strain of influenza A virus emerged and spread rapidly throughout the world. The 1718 strain—called the “Spanish flu,” although it is unclear where the strain originated—was unusual in that it tended to cause death in young and healthy adults, as opposed to young children and the old and infirm. Spreading efficiently through droplet transmission, the upheaval of the war probably made global transmission more efficient. The disease first spread easily in close military quarters and was then translocated throughout the world as troops returned to their native countries. The pandemic, lasting about a year, killed an estimated 20 to 50 million people, and sickened another 300 million. In the United States, 300,000 or more people are believed to have died of the 1718 flu. By comparison, in 2003, an estimated 3 million people died from human immunodeficiency virus (HIV) infection globally, of which about 20,000 of those deaths occurred in the United States, and an estimated 35 to 40 million globally were living with the infection. Thus, the 1718 flu counts among the most devastating events in human history.

More recently, a previously unknown agent, severe acute respiratory syndrome (SARS) coronavirus, emerged from Asia and spread rapidly throughout the Pacific Rim and beyond. Between November 2002 and July 2003, more than 8,000 people worldwide developed SARS, of which nearly 800 died. The SARS virus is spread primarily by droplet via person-to-person contact, although aerosol transmission may also play a role. The SARS outbreak was particularly problematic in Southeast Asia, as well as in Canada, where more than 200 cases were recorded in multiple provinces. In late July 2003, the World Health Organization (WHO) declared the global outbreak to be over, although laboratory-associated infections have since occurred. While SARS is thought to have been eliminated from the human population, its significant case fatality rate (approximately 10 percent), and its ease of spread, warrant continued global surveillance.

New infectious agents continue to emerge. Over the past 30 years, dozens of previously unknown agents have been discovered. The nature of these newly emergent pathogens is varied, and in many cases is incompletely understood. Moreover, known pathogens can mutate to create new strains to which the human immune system is naïve or ill-adapted, and which do not respond to existing vaccines and pharmacological treatments. This scenario is common among influenza viruses, which undergo reassortment at random intervals, potentially producing pandemic strains capable of causing devastating disease.

For example, after the devastation of 1718, influenza pandemics occurred in 1957 and 1968, although they were less lethal than their 1718 predecessor. Today the world awaits the next such pandemic event. At present, there is heightening concern that a recently emergent avian influenza virus, A/H5N1, may have the potential to ignite the next influenza pandemic. Regardless of what happens with avian flu, the question is when, not if, the world will next experience pandemic influenza. Whether the next pandemic will unfold into a catastrophic event, akin to that of 1718, will depend on a variety of molecular, social, economic, medical, and public health factors.

On the darker side of human endeavor is the specter of biological warfare and bioterrorism, in which dangerous communicable agents can be spread either intentionally to human populations or agriculture. Although the Cold War superpowers did not engage in biological warfare, accidental infections have occurred among workers involved in weapons programs. Similarly, at least one catastrophic accidental release of a bio-warfare agent has been documented. Accidental infections with common pathogens have also occurred in research as well as public health laboratories. If terrorists, who almost certainly lack sophisticated laboratory infrastructure and advanced containment protocols, pursue bio-weapons, the world will likely be at risk for additional accidental releases, to say nothing of intentional attacks.

In addition to morbidity and mortality, the widespread propagation of disease can have devastating social and economic impacts. For example, the U.S. General Accounting Office (now the Government Accountability Office) reported that the SARS crisis “temporarily dampened consumer confidence in Asia, costing Asian economies $11 billion to $18 billion and resulting in estimated losses of 0.5 percent to 2 percent of total output, according to official and academic estimates. SARS had significant, but temporary, negative impacts on a variety of economic activities, especially travel and tourism. Moreover, in the modern world of international investment, economic losses in one region can affect performance in distant regions. For example, a number of U.S. companies suspect that they experienced quarterly losses in 2003 because of SARS.

The tremendous impact of the HIV/Acquired Immune Deficiency Syndrome (AIDS) pandemic on African economies has been widely studied. Certainly, the impact has been complex; however, the pandemic has clearly tended to reduce labor supply and productivity while increasing the need for imports. Economic models suggest that HIV/AIDS will continue to have serious, long-term economic consequences throughout sub-Saharan Africa. These consequences pose a challenge to all nations, which must deal with reduced foreign markets and the increasing need for humanitarian outreach and international aid.

Emergent plant and animal diseases also affect economic and social institutions. In 2001, during the outbreak of foot and mouth disease in the United Kingdom, agricultural producers suffered estimated losses of £355 million, or about 20 percent of the region's estimated total income from farming that year. Moreover, businesses directly affected by tourist expenditures lost an estimated £2.7-£3.2 billion as fewer people visited the countryside. Overall, the outbreak is estimated to have reduced the United Kingdom's gross domestic product by approximately 0.2 percent in 2001.

A variety of socio-economic factors encourage the spread of infection. Predating the Spanish flu pandemic by nearly three decades, the influenza pandemic of 1890 demonstrated that even by that the late 19th century, steamship and railroad travel had made the rapid, global propagation of pathogens possible. Since then, influenza pandemics have reached farther and wider as a result of advances in travel technology such as air transport, and on account of the global proliferation of traffic nodes.

The world is growing more crowded and more interconnected. Should any pathogen, such as influenza A/H5N1, emerge as a major threat to human health, it could spread rapidly throughout the globe. In 1718, the global population was approximately 1.8 billion; today it has surpassed 6 billion, and it is projected to be near 9 billion by 2050. This population increase has resulted in the phenomenon of megacities (cities with populations greater than 10 million). In 1960 there were two such cities; by 2000 the number had risen to 20 worldwide. The number of such cities projected for 2015 is 26, with 22 of them in developing nations and 18 in Asia alone.

Such high populations are likely to dramatically increase the opportunity for close-contact transmission of many pathogens, including diarrheal, respiratory, and foodborne agents, and to pose unique problems for rapid containment. This concern is particularly relevant for megacities in the poorest areas of the world, where problems of malnutrition and diminished immune status may be expected to grow. It can be expected that large populations will surpass the threshold numbers required to sustain as-yet undiscovered diseases. Measles, for example, did not become endemic until the size of human settlements surpassed about 250,000 inhabitants. It is not unreasonable to expect that newly emergent pathogens will gain footholds and thrive in our increasingly densely populated world.

The proliferation of air travel is also likely to facilitate the transmission of new pathogens. In 1990, 81 million passengers and 7.7 billion pounds of freight were exchanged between the United States and the rest of the world by air. By 2004, global air traffic had increased to 138 million passengers and more than 16 billion pounds of freight. In 1718 there was essentially no international air travel; today more than 18 million commercial flights serve virtually all areas of the world. It is currently possible to circumnavigate the globe in 36 hours via regularly scheduled commercial flights.

These statistics highlight two crucial epidemiological trends over the last few decades: (1) people are traveling more, and (2) travel times are dramatically shorter. Thus, larger numbers of individuals are traveling to areas where exotic and emerging pathogens circulate. Similarly, more people, living species, and agricultural commodities are crossing borders than ever before, increasing the likelihood that diseases previously confined to areas where populations have developed some immunity to the responsible pathogens will be introduced into immunologically naive populations. Moreover, once a new pathogen appears in any given area, it is increasingly likely to be spread globally via air travel within a matter of days. Under these combined circumstances, nations are likely to confront new infectious diseases as their citizens travel abroad and return. The effects of such easy disease transmission on international commerce could be enormous.

To preserve human health and economic well-being in this new context of dense populations and frequent air travel, there must be effective global surveillance of infectious diseases. Only through such surveillance will it be possible to prevent outbreaks from spreading throughout the world. The payoff from early warnings regarding infectious disease is well documented. Cases accumulate exponentially in the early phases of an epidemic until some degree of population immunity is achieved. At this point, new cases appear more slowly and eventually cease. If an epidemic is identified early, and effective control measures such as quarantine, immunization, and travel/trade embargos are instituted, the appearance of new cases slows, and ultimately ceases as the epidemic is brought under control. In such a scenario, morbidity and mortality are significantly lower than in the case with no early warning and no control. Under the best-case scenario, one with even earlier warning and action, action is taken earlier and the epidemic is brought under control shortly thereafter. Note that, because of the exponential character of epidemics, the tiniest bit of action taken at an early stage can dramatically alter the course of the disease; thus, illustrating the importance of early warning.

One international effort to facilitate early warning of dangerous infectious disease globally involves the International Health Regulations (IHRs; revised in May 2005). The IHRs have their origins in the cholera epidemics that affected Europe between 1830 and 1847. They are designed to “prevent, protect against, control and provide a public health response to the international spread of disease.” The regulations require member nations to notify WHO within 24 hours of finding a single case of any of the diseases on WHO's watch list, □ and to report other public health emergencies of “international concern” within their jurisdictions. Furthermore, the regulations set forth measures for deratting, disinfecting, and disinsecting international conveyances (such as ships and aircraft) at points of arrival and departure. Under the IHRs, nations are obligated to build “national capacity for routine preventive measures as well as to detect and respond to public health emergencies of international concern. These routine measures include public health actions at ports, airports, land borders and for means of transport that use them to travel internationally.” Thus the IHRs embody the public health maxim that “an ounce of prevention is worth a pound of cure.” However, not all nations have the capacity to detect such diseases within their own borders. Moreover, in some instances, governments may suppress information relating to outbreaks that could lead to adverse economic consequences (such as embargoes on potentially infected products, or reduced tourism).

At the international level, there are both official and unofficial mechanisms for international disease notification. The primary means of official notification resides with WHO's Global Alert and Response Network (GOARN), a global network of WHO member states, United Nations organizations, and partner nongovernmental organizations that respond to the needs of member states in crisis due to epidemic disease. Standardized protocols for network structure, operations, and communications have been agreed upon in an effort to improve coordination among the GOARN partners. WHO provides institutional support for the network (such as employment of project managers and support for the steering committee) and coordinates international outbreak response using network resources.

One of the services GOARN maintains is the Canadian-based Global Public Health Intelligence Network (GPHIN), which scans global media articles via the Internet for references to disease outbreaks and epidemics. In May 2003, partly in response to the SARS crisis, WHO began formally utilizing the information sources searched by GPHIN, in conjunction with government reports, to issue global disease alerts.

Unofficial notification occurs via many means, such as the Program of Monitoring Emerging Diseases (ProMED), a moderated Internet mailing list that tracks emerging infections of humans, plants, and animals. Currently a program of the International Society for Infectious Diseases (ISID) and, therefore, not bound by multilateral agreements, ProMED does not require an official mandate to post and disseminate information. Rather, it relies on volunteers from around the world to submit information about infectious diseases and related issues. Much, though not all, of this information consists of media articles. Submitted material is vetted by a group of experts so that only the most relevant and credible information is displayed.

The GPHIN utilizes the international connectivity of the Internet to help detect disease. However, at present, it is limited by the volume and type of media material that can be processed in multiple languages. Additional limitations affect WHO's ability to detect and assess a rapidly spreading epidemic, including the lack of adequate public health infrastructure in member nations, the potential involvement of a previously uncharacterized pathogen (as in the case of SARS), ongoing conflict that inhibits the flow of information out of nations, and the willingness of states to report the extent of an outbreak.

Likewise, while ProMED and similar networks are valuable complements to official sources, their volunteer nature limits the number of staff available to process the volume of submitted reports. Similarly, such networks tend to have limited resources to support language translation issues. In concert, these limitations compromise the ability of volunteer networks to provide comprehensive early warning of potential outbreaks.

The limitations described above highlight differences between raw medical surveillance data and the translation of those data into actionable threat information. During the initial stages of an outbreak, very little may be known about an event. As time progresses, this uncertainty typically decreases, though it may do so slowly. On the other hand, the probability of translocation to other areas tends to increase with time. The key public health question is, when should action be taken to reduce risk? There are many choices. The time at which preventive actions are taken will be a function of medical as well as social, political, and economic factors. Nonetheless, actionable information acts to decrease uncertainty and can provide a strong rationale for early measures to minimize the likelihood of translocation.

For example, the VEE epidemic represents a possible translocation issue for the U.S. given air traffic from Maracaibo, Venezuela, connected directly to Miami (with unknown connector flights to other destinations within the U.S.) that seasonally peaked during the month of containment loss. To-date, it is unknown whether it would have been possible that VEE could translocate to Miami, trigger an outbreak that progressed to an epidemic, ecological establishment, and repeated seasonal transmission thereafter for years to come. A comprehensive assessment of the transmission competency of endemic mosquito species in Miami would be necessary to determine if this was a valid hazard concern.

In 1979, a laboratory accident involving aerosolized anthrax occurred in Sverdlovsk, Russia. From Apr. 14-May 18, 1979, local media in Sverdlovsk explicitly reported the occurrence of a series of human cases of inhalation anthrax along with draconian countermeasures as officials sought to rapidly contain and conceal the true etiology of the event. In 1992 and 1993, a team of American and Russian researchers led by Meselson and colleagues traveled to Sverdlovsk to investigate evidence for two hypotheses of the anthrax epidemic of 1979, the official USSR version that infected meat caused the outbreak and the US intelligence claims that the true etiology of the epidemic was an accidental release of aerosolized anthrax spores from the Compound 19 within the Voyenny Gorodok 47 biological weapons laboratory located in the city. The Meselson team concluded that an accidental aerosol release had indeed occurred on Apr. 2, 1979, resulting in what is thought to be the largest documented outbreak of human inhalation anthrax in history. Declassified U.S. intelligence archives suggest the intelligence community was unaware of this event until months after the fact.

This example highlights the requirement for a tactical approach to detect biological events and to baseline not only the epidemiological data for the disease itself, but social responses as well. Identifying “unusual” biological events that are evolving rapidly, with an attendant recurrence, elevation, and diversification of the indications and warnings, may assist in a time-sensitive evaluation of whether there may be a question of attribution. Of particular note, through retrospective and prospective studies, it has been known that an indicator of intentional release is not often the first apparent tip to an analyst; rather, it is the identification of an “unusual” biological event that is later found to be related to suspicious conditions.

In each of those examples, the biological event in question produced a “ripple effect” whereby indications and warnings (I&Ws) appeared in media. The analyst is in a struggle against time to put pieces together to tell enough of a coherent story to alert the user community to awareness. To properly capture events and triage advisories appropriately, a blended I&Ws reporting requirement list that incorporates an integrated disease list has been developed.

At a workshop sponsored by the U.S. Army in 2004, entitled “The Role of Indications and Warnings for Prediction and Surveillance of Catastrophic Biological Events,” a panel of researchers concluded that decision making could be strengthened by using a graded threat awareness scale that corresponds to varying levels of confidence in the potential for a given disease to spread. However, neither the newly revised WHO Pandemic Phases nor the Canadian or U.S. governments' current concepts of how to respond to pandemic influenza provide specific guidance for cross-matching actionable threat information to response decisions. Such a scale could support a graded response to imperfectly known threats.

Researchers have recently investigated the use of local-level syndromic surveillance to anticipate reports of clinically confirmed disease. The basic idea is to target the early manifestations of disease as they are presented to health care workers. Whether syndromic surveillance is likely to detect an epidemic sooner than the usual methods (such as reporting by alert clinicians) is unclear. Nonetheless, the underlying concept of discerning peri-event signatures that are indicative of disease transmission may prove useful in initiating more traditional epidemiological investigations at earlier points in the life of a disease.

Panelists at the 2004 I&W workshop concluded that there may be observable social, economic, and political signatures of the early stages of large-scale biological events. If such signatures could be used to cue confirmatory epidemiological investigations, it might be possible to identify and contain outbreaks days or weeks before current approaches are able to do so. Thus, these I&Ws may have the potential to identify an epidemic in its early stages, thereby allowing for more effective control and prevention of its regional or international spread. Conference attendees published their proceedings in 2004, which included postulating use of the Internet to gather documents containing I&Ws for event detection purposes.

No single approach can achieve that goal. Rather, a combination of approaches is necessary to reduce risk and support decision making that is meant to mitigate the translocation of biological threats. It is posited that at least two approaches can be combined to create a global I&W system that meets the goals of early detection. The first is environmental surveillance for ecological markers of conditions supporting disease emergence. In many areas of the world, environmental conditions are known to play a key role in the emergence of certain infectious agents, especially vectorborne pathogens. Satellite sensors can monitor environmental conditions (such as vegetation health, rainfall, and standing water) over large areas, and the risk of disease emergence can be assessed based on those observations. Static or seasonal maps of areas that are ecologically suitable for disease emergence or transmission, coupled with maps of transportation routes for different disease hosts (such as humans and livestock) and vectors, may prove useful in the short term. Longer-term, more sophisticated systems that collect and update such data, then make it available for analysis, are needed.

The second approach is to monitor the Internet for markers of “social disruption” that indicate anomalous infectious activity. Both adaptive and maladaptive social responses to disease epidemics have been studied previously. McGrath (1991) examined the historical ethnographies of 229 ethnic groups from North and South America, Europe, Africa, the Middle East, Russia, and Oceania, and recorded six basic social responses to epidemics: mass evacuation or flight from the site of the epidemic, extraordinary therapeutic or preventive measures, scapegoating of individuals or institutions, acceptance of disease, ostracism of the ill, and conflict. Of these, flight was found to be the most common. McGrath noted that the acute, dramatic appearance of disease, particularly unfamiliar disease, with high rates of morbidity and mortality provoked the strongest social response. McGrath observed that if established indigenous countermeasures failed, innovative countermeasures were undertaken, followed by social disruption that began with flight, then proceeded to acceptance of disease, to rejection of authorities, and ultimately to conflict. This sequence, McGrath argued, depicts a society that has degraded from normal functional status to disintegration. It is inferred that social disruption may be characterized as an emergent property triggered by a biological threat that defies countermeasures, control, and containment.

The near-global coverage of the Internet has led to highly fluid formal and informal reporting of local and regional news and events relating to social disruptions. Such reports contain information about the collective behavior of communities, which in turn can provide implicit or explicit information on the health of peoples. That approach promises to become even more comprehensive as the Internet continues to expand its reach. The use of the Internet for that purpose was described in a NASA open source reader in 1998.

Both approaches—monitoring the environment for ecological markers of conditions supporting disease emergence, and monitoring the Internet for markers of “social disruption” that indicate anomalous infectious activity—are supported by advances in information technology. For example, multisource data mining, machine learning, and language translation algorithms are now available to expand upon the GPHIN approach of exploiting published media reports. Similarly, remote sensing of many types of relevant environmental conditions is now possible at low or no cost over the Internet. Moreover, computer modeling and simulation can be used for analysis. For example, the Soviet Union developed computer models that successfully tracked the nationwide spread of influenza. Later, the approach was used retrospectively to analyze the 1968-69 pandemic of influenza A/H3N2. Recent research suggests that such a tool, together with up-to-date data on transportation and human movement, could be used to examine optimal control strategies early in an epidemic. Transportation network analysis, an active area of research, is necessary for understanding the relationship between global connectivity and infectious disease threats.

A global I&W system represents only one tool in the armamentarium of ways to address the threats posed by infectious disease. Knowing that a potentially threatening biological event is under way somewhere in the world is a necessary condition for mitigating its effects, but it is not sufficient. The social, economic, and political context of the threat will determine how it can be controlled. It may be useful to think of infectious disease threats in the same way we think of meteorological threats. In the United States, the National Weather Service operates a warning system for severe weather. It is generally acknowledged that this system is important and worth maintaining and improving, but knowing that severe weather is threatening a particular area at a particular time is not sufficient to save lives; people must use the information to that end. Such is the case with infectious disease, as well.

Nonetheless, early warnings of infectious disease threats remain a pillar of public health and national biodefense decision making, especially in the face of current world trends in transportation, commerce, and population. A global I&W system would be broadly applicable to natural, accidental, and terrorist-induced outbreaks of human and agricultural diseases. Any such tool would facilitate more informed decision making and thus would help mitigate the international spread of infectious disease.

It is necessary to have a balance between rapid identification and interdiction of state and non-state sponsored biological weapons programs, and recognition that a Spanish flu of 1718 scenario may severely incapacitate a country's infrastructure, to include its national defenses. It has traditionally been the view held by some that pandemic influenza is a public health, not intelligence concern. The creation of Homeland Security Presidential Directives-7, -9, and -10; National Security Presidential Directive-33; and now Public Law 110-53 have attempted to resolve this debate and place intelligence within the context of a national biosurveillance integration strategy for biological defense. The operational reality is that a biological event effecting humans or animals may translocate from any place in the world to any other place before adequate ground truth and attribution assessment can occur due to the substantial global transportation and commerce grid.

Topical and geographic focus, as has been observed in agencies such as the U.S. Centers for Disease Control and Prevention (CDC) and the U.S. Department of Agriculture (USDA), has been typically stove-piped within specialty programs that do not readily share information with other programs. A global picture of situational awareness that spans all attribution concern has not been available either at the intra- or inter-departmental level sufficient to operate in the new global operations environment that demands a rapid event detection-to-ground truth cycle. Determination of attribution is often a time-sensitive forensic process, as evidenced by the activities of Aum Shynrikyo, the Rajnessh cult, Japan Unit 731, South Africa's Project Coast, and the immense former Soviet Union biological weapons program. Coupling near-real time I&Ws to traditional epidemiological approaches and intelligence is a critical need in today's global community.

The intelligence community faces several issues that may present themselves similarly in an I&W detection and tracking environment:

-   -   Naturally occurring biological events of high potential         consequence to the U.S. and its allies (e.g., pandemic         influenza);     -   Accident of biotechnology producing a biological event of high         potential consequence to the U.S. and its allies (e.g.,         accidental reintroduction of pandemic influenza virus that has         not been in global circulation for decades such as the         appearance of H1N1 in 1977);     -   Accident of state or non-state sponsored biological weapons         program resulting in a biological event of high potential         consequence to the U.S. and its allies (e.g., hypothetical         laboratory accident involving terrorists that releases a         respiratory virus that transmits efficiently in the surrounding         community);     -   Intentional test release of a pathogen by a state or non-state         sponsored biological weapons program resulting in a biological         event of high potential consequence to the U.S. and its allies         (e.g., terrorist group performs a limited field test with a         respiratory virus that transmits efficiently in the surrounding         community); or     -   Intentional deployment of a biological weapon by a state or         non-state sponsored program (e.g., terrorists deliberately         release plague in an urban area).

Other issues of importance include rapid identification and monitoring of suspected state or non-state programs that may be associated with named individuals or institutions.

There are different time differentials, sensitivities, and specificities associated with the spectrum of products involving rapid processing of I&Ws versus a finished assessment.

I&Ws have long been used in intelligence. In World War II, one effective indicator of the success of Allied bombing raids on the French railroads was the price of oranges in Paris: successful raids disrupted transportation of the fruits (from southern France to the capital city) sufficiently to cause a transient price spike. In biological surveillance, one family of indirect indicators is based on observations of other, “sentinel” species that serve as alternative hosts for an infectious agent or are exposed to the same pathogen to a relatively greater extent. Historically, a plague epidemic was heralded by the appearance of numerous dead rodents: the fleas that spread the disease among them would now move to another source of blood (human hosts). Human methyl-mercury poisoning in Japan (Minamata disease) was preceded by neurological symptoms in cats that ate contaminated fish, and bird deaths are monitored during surveillance for West Nile fever. Other indicators are more remote, in that the causal links are less obvious given the state of existing knowledge. Edward Jenner's work on vaccination was triggered by his exploration of the old wives' tale that milkmaids rarely got smallpox. It turned out that, after they were infected with cowpox from the animals they milked, they acquired immunity to the smallpox pathogen.

Today's researchers, unlike their counterparts of the past, have near real-time access to vast amounts of different types of data. The technologies grouped under the rubric “data mining” enable researchers to plow through data to generate a vast number of hypotheses in a prioritized fashion. Offset against that advantage are the following caveats:

-   -   Few data mining techniques are associated with tests of         statistical significance: a suggested hypothesis does not         usually come with a “p-value.”     -   An observed relationship between an indirect indicator and the         pathogen does not, in the absence of additional knowledge, imply         cause or effect. Further, because of the problem of confounding         variables, the factors that may be predictors may turn out not         to be so. That problem has been known since the devising of the         Pearson correlation coefficient in the early 20th century. Two         variables that were apparently highly correlated were found to         be associated only because of a third unconsidered variable. The         spurious association reported by Huff between the salaries of         Presbyterian ministers in Massachusetts and the price of rum in         Havana (with the common variable being inflation) is an example         of this problem. Thus, indirect indicators may not always         reflect cause-and-effect relationships.     -   Given finite resources, the number of hypotheses that one would         like to explore is always far greater than the number of         hypotheses that one can explore.

Previous investigators have attempted to define event evolution as a function of media reporting. Cieri and colleagues (1802) proposed that an event be defined as “a specific thing that happens at a specific time and place along with all necessary pre-conditions and unavoidable consequences.” Makkonen (2003) observed that a seminal event can lead to various related events and outcomes, and the initial cause of these events may become less obvious over time. Chin and colleagues proposed that a media-reported event can be considered “a life form with stages of birth, growth, decay, and death,” and that the maintenance of the reported event is dependent on sensationalism. Those analyses, however, did not identify variable properties of media reports that are potentially useful as indirect measures of the intensification of I&Ws over time: recurrence, elevation, and diversification. By these measures, biological events may be reported as increasingly complex phenomena over time, whose “nourishment” is dependent on whether the biological agent in question is perceived by the local community to remain an active threat of concern.

What is needed is a system and method for facilitating disease reporting that covers regions of the world where there is poor or nonexistent disease reporting infrastructure. What is also needed is a system and method that provides situational awareness of biological events, especially epidemics of infectious disease in regions where such information is suppressed, and newly-emergent pathogen events where the pathogen has unknown characteristics. What is also needed is a system and method to detect and respond to a pandemic that is transparent, facilitates scientific cooperation, and allows for rapid reporting of biological outbreaks in birds and humans.

In the summer of 2004, an allocation of funds from the Intelligence Technology Innovation Center (ITIC) and the Department of Homeland Security (DHS) was made available to support research and development of a completely novel approach in foreign biosurveillance: Project Argus (now Global Argus), which is a real-world embodiment of the presently disclosed invention.

Though biosurveillance has been a concern, others have been concerned with situational awareness of non-biological events. The UN Office for the Coordination of Humanitarian Affairs, Reuters Foundation, International Crisis Group, International Relations and Security Network, Virtual Research Associates (VRA), and European Media Monitor are examples of organizations that coordinate the collection and dissemination of information about events unfolding around the world or in particular regions or counties. Numerous theoretic models for early warning have been used, including the Minorities at Risk Project, PIOOM, and the State Failure Project. Conflict prevention scholars have performed comparative analyses of the typology, methodologies and effectiveness of these models.

Previous methodologies are disadvantaged in that they do not update hourly or even daily in some cases, do not combine rigorous baseline methodology with advanced searching and reporting criteria, do not use thousands of sources and multi-language search capability, do not utility regional, cultural, and language experts as filters to interpret data, do not use a tested taxonomy that categorizes events and provides insight into the significance and magnitude of an event, do not use multiple layers of event analysis and description, and do not use interactive visualization tools.

SUMMARY OF THE INVENTION

Biosurveillance requires a synthesis of analytic approaches derived from the natural disaster, intelligence, public health, epidemiological, medical, veterinary, agricultural, meteorological, anthropological, and sociological communities, among others. With the exception of the GPHIN and WHO, at the time of conception of the present invention, a formalized professional discipline in operational biosurveillance with research, education, and training support did not exist in the U.S. Accordingly, a principal object of the present invention is to provide such an operational biosurveillance capability. In particular, it is an object of the present invention to provide a global biological event detection and tracking capability that provides early warnings of biological events, and estimations of the probabilities of such events escalating. To reach that object, the present invention relies on monitoring social disruption through native language reports in electronic local sources around the globe, because local societies are highly sensitive to perceived emergence of biological threats, and the resulting conditions and responses are readily identifiable through a granular review of local sources of information using the present system and method.

Without limiting the invention, the term “social disruption” is defined generally as a severe alteration in the normal functioning of a social system that, under extreme conditions, can lead to the collapse of social infrastructure. Social infrastructure is the range of activities, organizations, and facilities supporting the social relationship in a community. Extreme social disruption exacerbates the effects of biological events.

In the present invention, prior sociological and anthropological descriptions of social disruption have been expanded by applying the concept to the detection and assessment of biological threats. I&Ws (or markers) of social disruption, used in tandem with explicit reports of disease, can identify an emerging biological event and indicate its level of containment. Without limiting the invention, the term “containment,” as used herein, is defined to mean the process by which an indigenous public health and medical infrastructure undertakes countermeasures aimed at controlling an outbreak or epidemic. Three general conditions favor the use of the social disruption approach: inadequate support for local disease reporting, suppression of information about health crises, and/or the emergence of a previously unrecognized pathogen or of a well-known pathogen that displays dramatically different transmission properties than previously known or suspected.

Biosurveillance involves significant human analysis of information. Thus, it is another object of the present invention to use manual and automated computerized techniques for collecting electronic information relating to social disruption, by looking for specific I&Ws, and then analyzing the collected information. The information collection, information processing, and analysis subsystems of the present invention are based on the isisMiTAP (ISIS Center/MITRE Text and Audio Processing) platform originally developed under the Defense Advanced Research Projects Agency's Translingual Information Detection Extraction and Summarization Program (DARPA-TIDES). The isisMiTAP offered an online news service and information retrieval system that captured and categorized information according to user-defined, searchable newsgroups. Using isisMiTAP, media sources could be iteratively searched using multiple keyword combinations. The searches included only articles published in English or translated into English by regional news sources. To extend the range of data sources, additional searches with isisMiTAP using Chinese-, Russian-, and Korean-language local news sources and regional business and financial reports were conducted, sometimes with the aid of a bilingual human translator. From that platform, the present invention as described herein evolved.

It is another object of the present invention to use grounded sociological theory to develop a set of I&Ws of social disruption illustrating the dynamic properties of each type of social response over time. I&Ws include reports of human or animal disease, demand for medical services, local perception of threat, official acknowledgment, official action, and integrity of infrastructure. The I&Ws represent important ways in which an emerging epidemic might disrupt the everyday social life of individuals, communities, and institutions; therefore they might appear in local, regional, or national media sources. The expanded capture of media reports from local and regional sources provides crucial information because such sources reflect local knowledge of typical and atypical phenomena.

It has been hypothesized that selected I&Ws would display cross-cultural reliability. For example, it was expected that evidence of local perception of threat wherever and whenever individuals initially become aware of an emerging epidemic would be found. Without limiting the invention, the term “parameter” is used herein to refer to manifestations of a marker appearing in a specific cultural or historical context. It was expected that parameters for most markers would display not only cross-cultural reliability, but also cross-cultural variation depending on the impact of the disease and local conceptions of appropriate response. The parameters for some markers, such as official acknowledgment, may display less cross-cultural variation than others, a point for exploration in future analyses.

It is another object of the present invention to monitor the changes in I&Ws over time. Three variable properties of media reports have been identified—recurrence, elevation, and diversification—that may serve as indirect measures of the intensification of I&Ws over time. Without limiting the invention, “recurrence” is used herein to refer to additional reports of a phenomenon already described at least once in the media. Media reports may also describe increases in a phenomenon, indicating “elevation” of its manifestation. Without limiting the invention, “diversification” as used herein to refers to the emergence of multiple parameters of a single marker. As media reports repeatedly describe elevated levels of increasingly diverse parameters of particular I&Ws, a picture emerges of an uncontained, intensifying biological threat with increasing potential for international translocation.

Two broad categories of I&Ws—direct and indirect—are consistent across the cultures, diseases, and situations that have been examined. Direct I&Ws, such as reports of a disease outbreak, derive from medical and veterinary information and refer specifically to a biological event. Indirect I&Ws encompass qualitative information regarding social disruption and containment status, such as reports suggesting that individuals and organizations have begun changing their daily routine activities in response to a perceived biological event.

It was hypothesized that when analyzed as a whole, media-based I&Ws may reveal a biological event evolving in geotemporal fashion. Remarkable similarity in social disruption parameters across biological events that involved different pathogens affecting humans has been observed. This similarity has also been seen across multiple countries and time periods. The initial findings also suggested a similarity between the social disruption caused by pathogens that affect humans and those that affect animals. It is recognized that more research was required to define the temporal, pathogenic, and cultural differences that existed at various levels of the sociological model.

It is another object of the present invention to use manual and automated computerized techniques for identifying and collecting temporally dynamic social disruption evidence and ranking or defining the evidence by degrees, classifications, or categories.

It is still another object of the present invention to employ a heuristic staging model to characterize an event. Previous scales, including the Saffir-Simpson hurricane, Fujita tornado, and Ricther earthquake scales, are unsuited to the present invention. The present invention uses a heuristic scale model (previously referred to as the Wilson-Collmann Scale), based on the I&Ws of social disruption described above, which is better suited than current methods to detect and assess the dramatic appearance of diseases, particularly unfamiliar diseases, with high rates of morbidity and mortality. Such evolving biological events provoke the strongest social response.

The present heuristic staging model is intended to serve as a guide for analysis and as an interdisciplinary standard operational language. In daily real-world analysis, it would be unlikely to observe a logical flow of information representing the systematic progression of a biological event and associated social responses. As suggested earlier, gaps in information are likely because of inadequate local disease reporting, information suppression, the emergence of a previously unrecognized pathogen, or the emergence of a well-known pathogen that displays dramatically different transmission properties. As with the storm severity scales mentioned above, the present model estimates the impact of a biological event on society, and thus can be used to prompt response decisions about mitigative measures and anticipatory consequence management. Such decisions may range from simple requests for more information to quarantine, closure of country borders, and international trade restrictions. Just as most storms are given low stage designations, the majority of biological events that have been evaluated merited low stage assessments, reflecting the relatively low probability of social collapse induced by biological events.

It is another object of the present invention to provide I&W profiles using a sample size that is appropriate for gauging social disruption induced by various diseases over time. It is believed that there is some variation in the social disruption caused by biological events not only across diseases, but also across countries. That variation is likely due to differences in social tolerance for disruption, the sophistication of public health and medical infrastructures, and the direct effects of the disease in question. It was experientially observed that biological events taking place in countries with limited infrastructure were likely to progress to higher stages more quickly than those occurring in countries with more robust infrastructure. One disease in particular—influenza-like illness—was observed to consistently defy countermeasures across multiple countries with a wide range of medical sophistication. This finding was troubling, particularly in light of current concern about pandemic influenza.

It was hypothesized that, after an appropriate sample size was examined over time, each disease will exhibit a characteristic level of social disruption. If a disease that typically exhibits focal transmission characteristics was found to generate higher levels of social disruption, this information may prompt closer scrutiny of the event for unusual epidemiological features and possible implications regarding attribution. Stages 1 to 4 apply across disease and location, whereas Stage 0 appeared to apply for specific diseases in particular locations. Stage R refers to specific recovery aspects of observed events.

It is still another object of the invention to provide a staging model that allows for upgraded and downgraded staging on a dynamic basis. With the present model, biological event-related social disruption can be readily described as it escalates in severity. However, the dynamic progression of a biological event may follow a pattern of upgraded and downgraded staging to reflect successful countermeasures or particular disease transmission characteristics, among other factors. Future investigations would be necessary to address the difficult question of how social disruption is resolved and normalcy is reestablished in a society. Furthermore, observations of media behavior suggested that reporting became more vigorous and descriptive as an event evolved and generated maximal impact (and sensational interest in media reporting). However, the media tended to report resolution of an event less vigorously, particularly in closed societies. This trend continues to pose a challenge to the analyst with regard to downgrading the staging of an event.

It is noted that the real-world application of the present staging model is likely be complicated by uncertainty. It is anticipated that media reporting of some biological events at any stage may provide only indirect I&Ws, just a single report, or social disruption parameters not included in the model. It is recognized that some biological events may not follow the order of social disruption embodied in the present model. Nevertheless, it is another object of the invention to use categories of recurrence, elevation, and diversification, along with proper contextualization of I&Ws, to allow for more precise categorization of a biological event. Such a staging model would likely be revised as more analyses were conducted across time, cultures, and disease entities. It is recognized that ongoing operational use of this methodology may yield longitudinal data that could be used in forecasting models.

Another object of the invention is to facilitate linking progressive warnings generated in Stages 1 through 4 with prompt, appropriately coordinated response decisions by response officials or end users. Difficult response decisions may have to be made using limited I&W information when no diagnostic information regarding the disease in question is available. If a response must await the receipt of diagnostic information, the resulting delay may mean the difference between mitigation and having to manage consequences after the fact. With regard to the current threat of pandemic influenza, the United States would be prompted into action once WHO has declared that a pandemic is occurring. It has been shown that the system can identify I&Ws that are often present in indigenous local media weeks to months in advance of WHO's public declaration of an international public health threat. Such I&Ws, considered within a model such as the one disclosed here, may enable earlier threat assessment that can prompt scrutiny for an event of potentially global concern.

It is still another object of the present invention to provide, in addition to the staging model previously described, a higher level of assessment of events. After testing an existing scale (in particular, the Wilson-Collmann Scale) on several biological events throughout the Pacific Rim, it was noticed that staging a biological event was useful for event evolution assessment and baselining local community social disruption, but was not as useful for triaging the volume of reports captured. The reason for this is, while detecting social collapse is important, the user community would want to be notified of an event and maintain awareness from the early piece of information to an event's ultimate outcome. Therefore, a higher level of assessment, akin to the National Weather Service's Warning, Watch, Advisory approach to storm tracking, was identified as a need and is used in the present invention.

One of the key research questions throughout the development of the present system has been, and continues to be, “what is actionable”? It was found, experientially, that if several members of a biosurveillance operations team at CDC, USDA, DHS, or DOD were posed this question, the response would involve different answers. It reflected no prior experience by these analysts with a codified operations approach to prioritize reporting requirements and understand how triaged information may drive response actions by their respective organizations.

The National Weather Service advisory system model was seen as the most relevant and appropriate one to use, however implementation was in a vacuum of federal guidance. Those involved in the present system initially tested various approaches of this model based primarily on social disruption, then later factoring other parameters as described later.

Still another object of the present invention includes providing a system and method for early biological event detection with high sensitivity of event tracking, which includes monitoring countermeasure efficacy and issuing actionable advisories.

It is another object of the present invention to detect agents that can compromise/collapse an Intensive Care Unit and therefore collapse an urban hospital and prompt a city-wide “medical blackout.”

It is still another object of the present invention to monitor and assess an event site in terms of its connectivity to the U.S. by air flights and commerce trade.

It is another object of the present invention to facilitate coupling the detection and forecasting capabilities of the present system with the collection of ground truth evidence by others, such as end users. Biological event evolution is a process where the analyst is typically not responding to specific, explicit declarations of an outbreak of an expertly diagnosed disease as the first tip of information to start an event case file. The first piece of information may simply be a report of bird die-offs; the etiology of the die-offs may involve dozens of possibilities. If the analyst notes panic buying of herbal medicines in the same province, that may or may not be related to the report of bird die-offs. The analyst struggles throughout the process with uncertainty, and over time this uncertainty is resolved in the flow of local and multi-lingual reporting. Ultimately, it is hoped this information is clarified with independent, trusted third party ground truth.

It is still another object of the present invention to identify evidence indicating that containment of an event has been lost.

Another object of the present invention is to provide the capability to monitor thousands of validated sources providing coverage in all recognized countries or regions around the globe in multiple languages within an online information harvesting engine (including languages such as, but not limited to, English, Spanish, Portuguese, Italian, French, Japanese, Korean, Farsi, Dari, Arabic, Urdu, Indu, Georgian, Chinese (Mandarin and Cantonese), Mongolian, Czech, Russian, Macedonian, Serbo-Croat, Bulgarian, Slovak, Bahasa Indonesian, Malay, Tajik, Uzbek, Kyrgyz, Kazak, Hindi, Greek, Vietnamese, Thai, Romanian, German, Hebrew, and Turkish).

It is still another object of the present invention to provide a subsystem for verification of information. The inherent nature of biological event I&Ws means that more often than not, there is a significant degree of uncertainty until ground verification has been obtained. Time is critical, particularly when considering the discovery of a novel influenza virus such as H5N1, and whether or not it may be efficiently transmitting between humans. Ground verification must occur rapidly enough to enable decision making for rapid global response. This has direct implications for decisions regarding sharing of information with state and local authorities.

Another object of the present invention is to provide a system for communicating to end-users specific or summary local- and country-level reports along with assigned stratified social disruption stage and/or alert levels, which may be used by others for developing actionable decisions.

It is another object of the present invention to communicate information about biological events for biodefense purposes. Biodefense is the application of government and private resources to protect the nation against biological threats, including natural, intentional, and unintentional threats. Biodefense is important for at least the following reasons: (1) disease outbreaks are not tied to geographic or political boundaries; (2) bioevents may cause catastrophic harm; (3) real-time early detection may have a significant difference in outcomes; and (4) there is a perception that the U.S. does not have sufficient capabilities to provide early detection and respond with an adequate response to bioevents.

Another object of the present invention is to provide a scalable system that can be adapted to adding additional servers and interface programs to accommodate increasing amounts of documents collected from information sources, as well as to accommodate more analysts running more and more queries, and to allow increasing numbers of end users/customers with access to the system.

Other objects of the present invention include disseminating information to end users, providing a multi-lingual search engine, providing machine and manual translation support, providing an input to allow users to annotate collected articles, providing a routine to categorize collected articles, providing an integration platform that ties the above features together, and to allow for special data fees and collection methods on an ad hoc basis.

Still another object of the present invention is to apply the biosurveillance capabilities of the system for use by government, corporate, insurance, financial, commodities, and investment entities, as well as to provide situational awareness of the public and private markets in which those entities operate.

Briefly described, those and other objects and features of the present invention are accomplished, as embodied and fully described herein, by a method for communicating event information, which includes the steps of outputting over a communications network a first instruction to a client computer to display a graphic on a graphical user interface device, the graphic including at least a planar representation of a portion of the Earth's surface, and retrieving information stored in a first database concerning a first event, wherein the first event is identified from at least one parameter associated with a plurality of indications and warnings, and wherein the information comprises a first scale value describing a severity of the first event and a geographical location describing a location of the first event. The method also includes the steps of selecting from a plurality of indicia a first indicia representing the scale value of the first event, the first indicia including a link to additional information concerning the first event, providing to a memory of the client computer the additional information for display on the graphical user interface upon activation of the link; and outputting a second instruction to the client computer to display the first indicia on the displayed graphic at the geographical location if the geographical location is within the displayed graphic portion of the Earth's surface.

The objects and features of the present invention are also accomplished, as embodied and fully described herein, by a system a system for communicating event information, the system including an event server in data communications with a plurality of client computers used to monitor a plurality of pre-determined events; a software subsystem for outputting a first instruction from the server to display a graphic on a graphical user interface device of each of the plurality of client computers, the graphic including at least a planar representation of a portion of the Earth's surface; a first database in data communication with the event server comprising a plurality of records, each record comprising information about one of the plurality of pre-determined events, wherein each of the pre-determined events is identified from at least one parameter associated with a plurality of indications and warnings and wherein the information comprises a scale value describing a severity of the one of the plurality of pre-determined events and a geographical location describing a location of the one of the plurality of pre-determined events; and a plurality of indicia for use in representing the scale value, each of the plurality of indicia including a link for displaying on the graphic additional information concerning the events upon activation of the link; wherein the plurality of indicia are displayed on the graphic at the geographical location if the geographical location is within the displayed graphic portion of the Earth's surface.

The analytic subsystem involves, on the human side, regional teams of analysts divided among regions of the world to conduct and report on primary open-source analysis of avian influenza or other events, a Supervisory Analyst, if needed, for each region to supervise and assure quality of regional analysts' work, and a Chief Analyst, if needed, to oversee all collection and analysis activities. A Chief Analytic Cell (CAC), if needed, composed of the Chief Analyst and two Deputy Chiefs, oversees the overall analytical quality and issues daily situational awareness reports based on reports from the regional analytic teams. The regions may include, but are not limited to, Southeast Asia, East Asia, Central Asia/Caucuses, Middle East, Eastern Europe/Baltic States, Oceania, Western Europe, Africa and Central/South America.

Regional analysts collect global open source information in accordance with I&Ws training and current reporting requirements. They analyze raw media articles, monitor and analyze outputs from Bayesian models as well as evaluate and dispose of case file summaries. Senior Analysts provide quality assurance and guidance in collection, analysis and reporting of biological events of interest. In addition, Senior Analysts and CAC create and post stratified country alerts. Other methods of administering the human side of the system are also possible and would be equally effective as the embodiment described above.

Preferably, teams of from three to six analysts focus on each of nine global regions outside of the United States, each supervised by senior and chief analysts. These teams of analysts are tasked with using the technologies described below to review local, globally distributed reports and summarize relevant indications and warnings. Their summary reports are forwarded to senior analysts and are stored into a database to be subjected to mathematical modeling and utilized in longitudinal studies. The senior analysts bring to bear their expertise in medicine, public health, epidemiology, climate patterns and other fields and develop daily national assessments from the raw data gathered by the regional analysts. Findings are also checked against established baseline data. This represents a fundamental shift in global biosurveillance: these analysts are not only experts in identifying trends in disease but also in social behaviors associated with these events at the local level. This in-depth understanding enables the analysts to recognize unusual conditions prior to diagnostic confirmation of the biological agent involved.

In one embodiment of the invention, analysts begin their day with Morning Report, where all of the analysts gather to share critical indicators captured overnight or over the weekend. This is a synchronization point for the day, where the participants employ “swarm awareness” for the top tactical issues that everyone needs awareness. The reason for this approach is while an analyst may be responsible for a given region reporting an event of concern, it is more often the rule that the event will be reported by multiple languages from sources in multiple countries with economic, political, or general interest in the region of concern. Often this additional reporting brings valuable information to consider.

The system and method may access over a million pieces of information daily covering every country in the world, and may generate 200 reports per day. Through the system, country- or regional-level Warnings, Watches, and Advisories may be issued in accordance with guidelines agreed upon by research partners in the federal government. There may be multiple Advisories, Watches, and Warnings active on the information processing and analysis subsystem at any given time, with thousands of individual case files of socially disruptive biological events being maintained and monitored daily in over 170 countries (for example) involving 130 disease entities affecting humans or animals (for example). The system maximum load is approximately 3,300 individual case files that may be maintained and monitored, which no previous global biosurveillance system could achieve. Advisories may be shared with user communities through the reporting Watchboard reporting module described herein.

The analytic aspect of the invention leverages an online harvesting engine to move through a codified triage of information to information dissemination within the user community. Raw data is captured and processed. This is the first filter pass and is based on keyword clusters and Bayesian information retrieval models. The second filter step is country level junior analysts who speak the relevant native vernacular reviewing system outputs (and primary sources if it is a language not covered by the system, which covers 18 languages, whereas linguists cover 34 languages; this reflects industry limitations in machine translation). The junior analysts review a set of reporting requirements that are a combination of social disruption I&Ws and a prioritized disease list. This pass results in the creation of an event case file. The event case file is forwarded to the regional senior analyst who checks the report for quality control and then assesses the information from the perspective of the country advisory level reporting requirements. Thus, the user community sees country level advisories but also has the option of drilling into detailed case files with their associated update reports.

Because the system accesses information at a much greater granularity and volume than previously done in global biosurveillance, it may be used to report any and all socially disruptive biological events that are being monitored that may affect humans or other animals. This provides the foundation of a species-agnostic approach to biosurveillance that satisfies the tenets in HSPD-7, -9, -10; NSPD-33, and PL 100-53.

Still another object of the present invention is to use open source information to support the identification, tracking, and early warning within a compressed time frame of outbreaks of emerging threats. I&Ws associated with emerging threats are identified and tracked in media sources through the recognition of destabilizing events, which may involve, infrastructure, economics, food security, weapons of mass destruction and proliferation, cyber-security, improvised explosive devices (LEDs), human trafficking, and narcotics. The near real-time surveillance of open source local news media combined with a relevant taxonomy and a tested coding system provides an analyst with a robust data set of trends and patterns.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING

Those and other objects, advantages, and features of the invention, as well as the invention itself, will become more readily apparent from the following detailed description when read together with the following drawings, in which:

FIG. 1 is schematic showing an operational overview of the present invention;

FIG. 2 is a schematic drawing of the basic components of the overall system according to the present invention;

FIG. 3A is a schematic diagram showing the general workflow of information from collection to output according to the present invention;

FIG. 3B is a continuation of the schematic workflow diagram of FIG. 3A, and shows the general workflow of information through the information collection and processing subsystem;

FIG. 3C is a schematic block diagram of the basic database architecture of the present system;

FIG. 3D is continuation of the schematic workflow diagram of FIG. 3B, and shows the general workflow of information through the information analysis and reporting subsystem;

FIG. 3E is continuation of the schematic workflow diagram of FIG. 3D, and shows the general workflow of information through the information communications subsystem;

FIG. 4 is a schematic block diagram of the basic architecture of certain aspects of the present system;

FIG. 5 is an information and workflow diagram of one aspect of the present invention;

FIGS. 6A and 6B are information and workflow diagrams of another aspect of the present invention;

FIGS. 7A and 7B information and workflow diagrams for the connector node B shown in FIGS. 6A and 6B;

FIG. 8 is an information and workflow diagram for the connector node C shown in FIGS. 6A and 6B;

FIG. 9 is a user interface template for setting up a scheduled search according to the present invention;

FIGS. 10A, 10B, and 10C are information and workflow diagrams of another aspect of the present invention;

FIG. 11 is a diagram of one feature of the user interface of the present invention;

FIG. 12 is a diagram of another feature of the user interface of the present invention;

FIG. 13 is a diagram of one stored document in the article repository database according to the present invention;

FIG. 14 is a diagram of yet another feature of the user interface of the present invention;

FIG. 15 is a diagram of the basic reporting template of the present invention;

FIG. 16 is a diagram of another feature of the basic reporting template of the present invention;

FIG. 17 is a diagram of yet another feature of the basic reporting template of the present invention;

FIG. 18 is a diagram of still another feature of the basic reporting template of the present invention;

FIG. 19 is a diagram of a template for reviewing and adding new information sources to be monitored by an analyst;

FIG. 20 is a diagram of an analyst worklist que according to the present invention;

FIG. 21 is a diagram showing the event report reviewing template according to the present invention;

FIG. 22 is a diagram showing yet another event reporting template according to the present invention;

FIG. 23 is a diagram showing another view of the event reporting template of FIG. 21 according to the present invention;

FIG. 24 is an information and workflow diagram of the back-end information communications subsystem;

FIG. 25 is a screen-shot drawing showing a list of country- or regional-level event reports grouped by alert levels;

FIG. 26 is an example of a country-level event report for Algeria;

FIG. 27 is a schematic drawing showing a color-coded alert map;

FIG. 28 is a schematic drawing showing a screen shot of a GIS system displaying data from the present invention;

FIG. 29 is a timeline drawing displaying influenza-related event information for China and other countries;

FIGS. 30A and 30B are drawings showing charts containing summary and statistical data from the present invention;

FIG. 31 is a drawing of an exemplary summary chart showing time series traces of the number of I&Ws detected in the data obtained from various information sources according to the present invention;

FIG. 32 is an information and workflow diagram of another aspect of the present invention;

FIG. 33 is a block diagram of a the components and subsystems of a graphical user interface for the invention described in FIG. 32;

FIG. 34 is a diagram of a window or form for creating a new event report according to one aspect of the present invention;

FIG. 35 is a diagram of a window or form for creating a new article according to one aspect of the present invention; and

FIG. 36 is a diagram of a window or form for creating a new action according to one aspect of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

Several preferred embodiments of the invention are described for illustrative purposes, it being understood that the invention may be embodied in other forms not specifically shown in any drawings submitted herewith or described below. The system and method of the present invention are illustrated using techniques for biosurveillance of biological events involving humans, animals, and plants; however, the invention is equally useful for other kinds of events that have social disruption potential and that can be detected using various I&Ws contained in open source (or other) documents. For example, the invention may be useful for, among other things, detecting and monitoring political, economic, industrial, and environmental events, as well as civil unrest such as dislocation, riots, violence against property or people; public opinion about any topic of interest including but not limited to political; public policy; religious or entertainment; natural disaster cause and aftermath; natural resource exploitation; and military activity.

I. System Architecture and Work Flow

Turning first to FIG. 1, shown therein is an operational overview of the present invention, which involves an information collection center 102, a communications infrastructure 106, and a plurality of information sources 104 around the world. The information collection center 102 may be a single facility within or outside the U.S., or multiple facilities scattered across or outside the U.S. operating together or independently and each operatively connected to each other via one or more communications networks (not shown).

The information collection center 102 receives and examines a continuous stream of information and/or data being generated over a communications infrastructure 106, which, as illustrated in FIG. 1, is represented by individual communications links between the information collection center 102 and the information sources 104. For purposes of this detailed description, the information and data are generally news articles in the form of web document files, such as XML, HTML, ASP, or other compatible file types (see discussion below concerning potentially incompatible file types). Essentially, any open source document, listserve, thread, email, database, etc., is a potential information sources 104.

The communications infrastructure 106 includes a communications network, such as a packet- or circuit-switched network, that is capable of transmitting information and data of any kind. The Internet is the preferred communications network for the present invention.

The information sources 104 shown in FIG. 1 are identified by reference to individual cities, countries, and/or regions where the data originate. There is no geographic or other restriction on where information sources 104 may be located, or where the information and data published or provided by those information sources 104 originate (the actual information and data may originate at the site of the information source 104, or remote from the information source 104). Although FIG. 1 suggests that information sources 104 are located at land surfaces, it is also possible that information sources 104 may be associated with aircraft and spacecraft platforms, as well as submarine platforms. Information sources 104 may be fixed or mobile. The information sources 104 may also be identified by reference to the source or type of information, such as news articles, web portals, really simple syndication (RSS) feeds, and blogs, to name a few.

To illustrate the relationship between the information sources 104 and the origin of information and data, assume the information and data originate at a hospital in Asia that is treating individuals that live proximate to the hospital. Reports of increased hospital visits are broadcast on a website published by a news reporting service in the same city as the hospital. The website is hosted by an Internet Service Provider (ISP) with web servers located in a city 100 miles from the city where the hospital is located. Under that scenario, the information source 104 is the news reporting service website (or, more accurately, the web server that stores the actual website files containing the reported information), although the origin of the information and data is the hospital.

The information and data in news articles or other formats are captured primarily from web sites, as described above, and then formatted into a common encoding representation (typically extensible markup language (XML) or other files), indexed for rapid query access, and stored in an article repository database, as described below.

Turning now to FIG. 2, shown therein is a schematic drawing of the basic components of the overall system 200, which includes an information collection and processing subsystem 202, an information analysis and reporting subsystem 204, and an information communications subsystem 206. The inputs to the system 200 are the information and data described above, which are pulled from information sources 104, as well as inputs from analysts that interface with the system 200 (described below). The outputs from the system 200 are various formatted reports communicating event-related information to end users, or simply raw or processed event-related information and data.

FIGS. 3A through 3E will now be described. The figures are schematic diagrams showing the general workflow of information from collection to output. Turning first to FIG. 3A, shown therein are exemplary information sources 104 connected to communications infrastructure 106 (the Internet, in this case). The information sources 104 are shown as being sources of web documents 300, which may be any file capable of being transmitted in packets (i.e., in the case of using the Internet Protocol or other packet-switched network to transmit files). Web documents 300 could include hypertext markup language (HTML), XML, portable document format (PDF), text, image, audio, and other file types. A web crawling module (not shown) contains instructions to crawl each of the information sources 104 on a regular schedule to identify those file types. The frequency of the crawl is generally every approximately four hours, but the frequency could be increased to every hour or some other faster frequency if warranted by a particular situation. Those skilled in the art will appreciate that the information sources 104 may include not only web documents 300, but also files created from radio frequency signals intercepted through the air, sea, and space platforms that are translated, transcripted, and/or encoded.

FIG. 3B is a continuation of the schematic workflow diagram of FIG. 3A and shows the general workflow of information through the information collection and processing subsystem 202. The subsystem 202 includes one or more document collection servers 302, one or more document processing servers 304, a “RAMBO” alert generation and modeling module 306, and a “Newsstand” document search and reporting interface module 308, which includes a language processing module for foreign language translation (described below).

The document collection servers 302 (also referred to as information source servers), store the data obtained from the information sources 104 after the web crawler identifies new information and data at the information sources 104 and the information and data are downloaded.

The document processing servers 304 perform several functions on the information and data after it has been downloaded from the information sources 104. One function is to translate the title of any documents that are downloaded (the body of the documents are not translated, but on-demand translation is available). Additional functions of the system 200 components will be described in more detail below.

FIG. 3C is a schematic block diagram of the basic database architecture of the system 200. The top block shown on the left represents the aforementioned “RAMBO” alert generation and modeling module 306 and the “Newsstand” document search and reporting interface module 308. The bottom block on the left represents the aforementioned document processing servers 304. The block on the right shows three databases: the article repository database 310, the searchable document/article indexes database 312, and the MySQL relational database management system (RDBMS) database 314. The document search and reporting interface module 308 (i.e., Newsstand) is a Tomcat-based JSP/servlet web application running on a Linux platform using the document index 312 and MySQL database 314.

The article repository database 310 contains the downloaded web document files from the information sources 104, and it maintains records for each document collected from the information sources 104 (or that are generated elsewhere, i.e., a document generated from two or more separate downloaded web documents). As described previously, the actual documents are encoded into a common file format, such as an XML file, but various formats are also possible. Each record includes a document identification, the title, published data, and the “body” of the document. In order to avoid duplicates, as each information source 104 is crawled, the available URLs for downloading are compared to the URL records in a URL database associated with the article repository database 310 to check for duplicates.

The searchable document/article indexes database 312 contains an index of words. Lucene is used for this purpose because it provides, through an inverted index scheme, relatively fast search results. Each document or data stored in the article repository database 310 has one or more keywords of interest. Thus, by identifying keywords, one or more documents having those keywords may be located using the indexes. Currently, the indexes in the articles index database 312 point to over 132 million document files, of which more than about 36 million are stored as English-language documents, with the rest being, in order, Chinese-, Russian-, and then Spanish-language documents. The documents may be in one of 18 languages, or a combination of languages.

The SQL databases contain records of end users, saved searches (described later), the uniform resource locator (URL) associated with the information sources 104, and other records.

Overall, the system 200 involves using about 20 document collection servers 302, document processing servers 304, various web servers (not shown), and other computing devices of various levels of performance running Solaris, Linux, and Windows OS. Java servlets, which are Java classes, are used. The user interface is generated using a combination of HTML, Java server pages, and Java servlets. The Linux operating system runs the MySQL relational database 314. The Tomcat application is used for userid/password authentication (user information is stored in the database 314). Java Database Connectivity (JDBC) is used to communicate with MySQL. The Linux file system is used as the document repository. Apache Lucene is the search engine, but is limited to dealing with text.

FIG. 3D is continuation of the schematic workflow diagram of FIG. 3B, and shows the general workflow of information through the information analysis and reporting subsystem 204. This subsystem involves using the “Watchboard” search and reporting module 316 by one or more analysts 318, senior analysts 320, and chief analyst 322 (this administrative allocation is for illustrative purposes only). The functions of each of the components of the system 200 will be described with reference to the process flow diagrams below. The search and reporting module 316 (Watchboard) is an ASP.NET web application running on IIS Windows Server running the MS SQL database 314.

FIG. 3E is a continuation of the schematic workflow diagram of FIG. 3D, and shows the general workflow of information through the information communications subsystem 206. This subsystem involves end users 324 accessing the system 200 over communications infrastructure 106, such as the Internet, to download web documents, analyst's event reports, maps, geographical information system (GIS) data, and raw or processed event-related information and data.

FIG. 4 is a schematic block diagram of the basic architecture of certain aspects of the system 200. In particular, certain components of the document processing servers 304 and the “Newsstand” document search and reporting interface module 308 are shown.

The document processing servers 304 provide an enhanced information processing infrastructure with a parallelized architecture for document capturing, formatting, and indexing. It also provides a centralized mechanism for managing the document or article repository database 310 and the data access services. As described above, the document processing servers 304 include a document source index 402, which is an index of the various information sources 104 (generally indexed to their respective URLs), and a document index 404, which is an index relating keywords to document identification numbers for index keyword searching purposes.

The document source user interface 406 allows an analyst 318 (or senior analysts 320 or a chief analyst 322) to access the document source index 402. The analyst 318 runs searches using the document search user interface 408. A list of search hits is returned from the document indexes 404 to the search results user interface 410. The analyst 318 may retrieve the document corresponding to an item in the results list from the document collection servers 310. If document translation is available for the language of the document, the user can translate the document using the machine translation gateway subsystem.

II. System Operation

The methods of using the system 200 will now be described in detail, beginning with reference to FIG. 5, which is an information and workflow diagram of one aspect of the present invention. Generally speaking, the system 200 collects global open source information and data in accordance with specific I&W training and current reporting requirements. The system is not entirely automated, but requires the expertise of human research analysts who analyze raw media articles, monitor and analyze outputs from Bayesian models as well as evaluate and dispose of case file summaries. More senior analysts provide quality assurance and guidance in collection, analysis, and reporting of events of interest. Still more senior analysts, or chief analysts, create and post stratified country alerts.

Step 502. The overall data collection effort according to the present invention is driven by a comprehensive targeting analysis, which stems from identifying a biosurveillance mission at the outset. The mission could be, for example, to monitor avian H5N1 influenza in Southeast Asia. Tables 1A and 1B below show many of the types of diseases and other events for which the system 200 is or possibly could be used for surveillance purposes.

TABLE 1A Disease Events Priority 1 Priority 2 Priority 3 Priority 4 Pandemic Botulism Abrin Atrovirus Influenze/Avian Bovine Spongiform African horse Campylobacter influenza Encephalopathy sickness Clostridium Respiratory Disease (Mad Cow disease) African swine fever perfringens Anthrax CCHF Akabane Diphtheria Ebola Creutzfeldt-Jakob Brucellosis E. coli 0157:H7 Marburg disease Chikungunya Filariasis Plague Flexal virus Cholera Gastroenteritis Rift Valley Fever Foot-and-Mouth Contagious bovine Giardia SARS disease pleuropneumonia Hand, foot mouth Smallpox Guananrito Contagious caprine disease Undiagnosed/Unknown (Venezuelan pleuropneumonia Hepatitis Disease Hemorrhagic Fever) Dengue Legionellosis VEE Hog Cholera EEE Malaria XDR-TB Junin (Argentine Far Eastern tick- Measles Hemorrhagic Fever) borne rickettsiosis Meniningococcal Kuru Glanders menigitis Lassa Fever Hantavirus Norovirus Machupo (Bolivian Hendra Pertussis Hemorrhagic Fever) Japanese Rabies Newcastle disease encephalitis Ross River virus Porcine Respiratory Meliodosis Rotavirus & Reproductive Nipah Salmonella Syndrome (PRRS) Psittacosis Sapovirus Polio Q fever Shigellosis Pox Viruses (except Rickettsia (Dysentery) smallpox) prowazekii Streptococcus Ricin Rickettsia rickettsii Tetanus Rinderpest Swine vesicular West Nile virus Sabia (Brazillian disease Hermorrhagic Fever) Vesicular stomatitis Tularemia WEE Yellow fever

Other Events Priority 1 Priority 2 Priority 3 Priority 4 Any HS3 event Complex mutli- Earthquake (5.0 or >) Cyclone/Hurricane/ Any HS4 event pathogen event Typhoon Biological weapon use Diagnostic confusion Food poisoning Drought Bird die-off Flooding (Trigger Industrial accident Volcano First report of Zone) (chemical) - to Pete Wild Fires pathogen in new Tsunami Industrial Accident region (Nuclear) - to Pete Healthcare prof./vets Poison anxiety or concern Laboratory accident Public panic due to bio event Ventilator depletion Vaccine accident Vaccine or therapeutic failure or compromised

Step 504. The targeting analysis enables identification of the key required Critical Information Requirements (CIRs) and what information sources 104 can provide such information in a timely and credible manner. The CIRs are typically a list of five to ten (preferably five) statements that define the key items that an end user 324 is primarily concerned about. An example of a CIR is any credible evidence of an act of intentional biological agent release.

Step 506. The targeting analysis also refers to defining the data collection requirements and implied social networks and information providers needed to obtain the data. This is accomplished by identifying I&Ws. I&Ws fall into two categories: direct and indirect. Direct I&Ws refer to explicit local reporting of disease in humans, animals, or plants that may describe epidemiological features of the event. It is important to monitor with a species-agnostic approach. Species tropism exhibited by diseases may give important initial clues as to the diagnosis. For example, prairie dog illness in Colorado is an important indicator of the possible presence (and imminent threat to human health) of plague.

Indirect I&Ws are further subdivided into additional categories such as official acknowledgement, official action, local perception of threat, business practice changes, and integrity of infrastructure. The indicators within these categories are numerous. They enable approximation, over time, of social functioning in the context of a biological event. The key objective in the use of indirect indicators is to provide an assessment of containment status and concurrent level of social disruption.

In a preferred embodiment of the invention, the information collection and processing subsystem 202 generally looks for the following three types of I&Ws in the data being collected from the information sources 104: (1) environmental conditions thought to be conducive to support outbreak triggering; (2) reports of disease outbreaks in humans or animals; and (3) markers of social disruption such as school closings or infrastructure overloads.

Table 2A below identifies typical I&Ws and associated parameters by category (Table 2B, which follows, provide more specific I&Ws related to plant disease events):

TABLE 2A Category Indication and Warning (I&Ws) and Parameters Direct Reports of human disease  Diagnostic impression   Known disease entity   Unknown/unsure disease entity  Epidemiological features   Geography   Uni-focal (city level and below)    Building    Public event    City   Multi-focal (beyond city level)    “Hopping” or non-contiguous geographic    involvement    Multiple cities or regions involved   Unique/strange clinical presentation   High morbidity/mortality   Unexpected appearance of disease in relation to season   Discrete population involved   Specific ethnic group   Nosocomial setting    Healthcare workers with unusual disease    Clinical workers    Lab workers    Patients contracting unusual disease while in    medical facility   Specific age group    Pediatric    Adult    Elderly   Occupational Reports of animal disease  Diagnostic impression   Known disease entity   Unknown/unsure disease entity  Epidemiological features   Unique/strange clinical presentation   High morbidity/mortality   Unexpected appearance of disease in relation to season   Discrete population involved   Specific species   Nosocomial setting    Animal care providers with unusual disease   Specific age group    Fetus    Abortions    Yearlings    Adults   Occupational   Specific ownership of animals    Rural    Commercial   Geography   Uni-focal (city level and below)    Building    Public event    City   Multi-focal (beyond city level)    “Hopping” or non-contiguous geographic    involvement    Multiple cities or regions involved Indirect Official Acknowledgement  Acceptance   Source of reporting   Low imperative    Local officials reporting   High imperative    Regional/national officials reporting    International reporting   Type of reporting   Acknowledgement of a biological event without   explicit declaration of an   official concern   Declaration of outbreak or epidemic   Declaration of “health alert” or emergency   Call for international assistance  Denial Official Action  Official investigation   Low imperative   Local civilian government involvement   High imperative   Military involvement   National government involvement   International organization involvement  Implementation of countermeasures   Agriculture countermeasures   Animal culling    Sheep    Cattle    Birds    Swine   Commerce and trade restrictions    Animal movement/transportation restrictions    Animal product sale restrictions    Market closures    Food kiosk or restaurant closures   Vector control measures   Mosquito spraying or fumigation   Habitat elimination   Pest control    Rodent control   Tick spraying or fumigation or cattle dipping   Hygiene campaign   Individuals    Clothing   Equipment    Farm    Medical    Industrial   Areas of public congregation    Businesses    Schools or child daycare    Medical facilities    Public transit    Places of worship   Official order for personal protection    Face masks    Gloves    Mosquito repellent or bed netting    Universal Precaution   Activation of pharmaceutical (including vaccines)   countermeasure program   Vaccines    Humans    Standard vaccines    Experimental vaccines    Animals    Standard vaccines    Experimental vaccines   Pharmaceutical recommendations   Infrastructure closure   Individual business closures   School closures   Public transit closures   Farm closures   Festival closures   Individual medical facility closure   Stock market closure   Government facility closure   Official order to quarantine   Low imperative    Individual patient plus contacts quarantine or    “house arrest”    Single animal/single herd quarantine    Quarantine of an entire farm or commercial facility   High imperative    Military or police called in for enforcement    Quarantine of an entire medical facility    Quarantine of entire city or region   Propaganda   Public awareness campaign   Border and Port of Entry closings   Refusal to allow an aircraft or ship to disembark   Closure of country's borders   Military enforcement of border closure  Activation of biosurveillance or screening   Patient screening and isolation   Businesses   Schools or child daycare   Medical facilities   Government facilities   Ports of entry    Airport screening    Road checkpoints    Ships   Animal and animal product screening   Farms   Festivals   Markets   Businesses   Ports of entry    Airport screening    Road checkpoints    Ships  Information suppression   Closure/censure of media   Censure of healthcare workers   Censure of general populace   Shifts in modes of communication  Prosecution   Scapegoating   Penalties against officials for mishandling event   Prosecution of citizens or industry for information   dissemination   Creation of hotlines to report “rumor mongering”   Prosecution of citizens or industry for profiteering   Creation of hotlines to report profiteering or “price   gouging”   Prosecution of citizens or industry for disregarding   quarantine Demand for Medical Services  Increased demand for medical care   Increased hospital visits or admissions   Demand for specialized or alternative care   Military medical care support called to assist civilian   medical providers   Creation of special ‘infectious disease wards’   Traditional healer    Shamanism  Demand for medications or supplies   Increased purchasing or use   Pharmaceuticals including vaccines   Medical supplies (e.g., face masks, gloves, gowns,   disinfectants, ventilators for healthcare provider use)   Stockpile depletion   Seeking behavior (e.g., individuals travel to additional   areas to purchase medications and medical supplies)   Innovative behavior (e.g., using bras as face masks)  Mobilization of medical resources   Mobilization of pharmaceuticals, including vaccines   Mobilization of medical supplies (e.g., face masks,   gowns, ventilators) Local Perception of Threat  Public “concern” or “anxiety” noted   Increased use of telecommunication or Internet   Divestment of pets   Heightened depression or apocalyptic feelings   Creation of psychological support centers   Reports of suicide    Reports of animal care provider suicide    Reports of farmer suicide  Hoarding and self preservation behavior   Changes in consumer behavior   Increased purchase of staples (i.e., hoarding behavior)    “Panic buying”   Pharmaceuticals, herbal medicines or home remedies    “Panic buying”   Increased purchase or use of medical supplies for   personal use (i.e., public wearing face masks)   “Panic selling” of livestock   Avoidance of purchasing animal products   Avoidance of public places   Decrease in tourism   Work and school absenteeism   Decrease in public transit use    Shifts in mode of transportation (e.g., from trains    and buses to bicycles)   Decrease in public presence in stores, festivals, and   religious sites   Voluntary evacuation  Public dissent   Public demonstrations   Public rioting   Appearance of newspaper editorials criticizing official   response   Appearance of blogs and chat rooms   Refusal to comply with government orders Business practice changes  Changes in pricing  Changes in product advertising campaigns  Black market formation Integrity of Infrastructure  Compromise   Individual business closures   School closures   Public transit closures   Farm closures   Festival closures   Pre-emptive stock market closure   Individual medical facility closure or refusal to see   patients   Medical infrastructure “strain” or at full capacity  Collapse   Multiple medical facility closure or refusal to see   patients   Industry-wide closures   Wide scale economic crisis with gross stock market   changes   Loss of access to basic staples, medications, or medical   supplies   Martial law declared   Complete social crisis declared   Complete dependence on international support Enviro- Temperature changes Climatic  Increased temperature implying enhanced vector  transmission competency  Decreased temperature (e.g., influenza-like illness) Precipitation changes  Increased   Flooding   Official Acknowledgement    Acceptance    Source of reporting     Low imperative     Local officials reporting     High imperative     Regional/national officials reporting     International reporting    Type of reporting     Declaration of emergency     Call for international assistance    Denial   Official Action    Evidence of countermeasures against flooding    Sandbagging    Drain cleaning    Mobilization of resources     Food and water supply     Water sanitation and sewage control     Sandbags     Personnel     Medical relief    Fortification of infrastructure    Buildings     Roads    Response    Refugee camp formation    Mobilization of resources     Food and water supply     Sandbags     Personnel     Water sanitation and sewage control    Call for international assistance     International organization on the ground     assisting    Use of military or mercenaries to secure supplies    and maintain civil security   Integrity of Infrastructure    Compromise    Business closures    School closures    Public transit closures     Roads     Railways     Bridges     Airports    Agricultural loss     Livestock killed     Crop destruction    Governmental facility closure    Medical infrastructure compromised     Loss of access to safe water     Work absenteeism     Shortage of medical supplies    Utilities     Electricity     Blackouts     Water works     Access to safe drinking water     Sewage system failure    Collapse    Loss of access to basic staples, medications, or    medical supplies     Famine    Agricultural loss     Livestock killed     Famine     Crop destruction     Famine    Martial law declared    Complete social crisis declared    Complete dependence on international support   Vector population changes (i.e., mosquitoes)  Decreased   Drought (e.g., meningococcal outbreaks)   Official Acknowledgement    Acceptance    Source of reporting     Low imperative     Local officials reporting     High imperative     Regional/national officials reporting     International reporting    Type of reporting     Declaration of emergency     Call for international assistance    Denial   Official Action    Countermeasures    Mobilization of resources     Food and water supply     Water sanitation and sewage control     Sandbags     Personnel    Control of resources     Food and water rationing     Movement of livestock    Response    Mobilization of resources     Food and water supply     Personnel     Medical relief    Call for international assistance     International organization on the ground     assisting    Use of military or mercenaries to secure supplies    and maintain civil security   Integrity of Infrastructure    Compromise    School closures    Agricultural loss     Livestock migration     Livestock killed     Crop destruction    Medical infrastructure compromised     Loss of access to water     Work absenteeism     Shortage of medical supplies    Utilities     Water works     Access to water    Collapse    Loss of access to basic staples, medications, or    medical supplies     Famine    Agricultural loss     Livestock killed     Famine     Crop destruction     Famine    Martial law declared    Complete social crisis declared    Complete dependence on international support

TABLE 2B Category Indication and Warning (I&Ws) and Parameters Direct Direct indicators include the traditional epidemiological alerts and markers. Examples of direct indicators include declarations of confirmed diagnostic results; specific, and/or unique or unexpected symptoms; geography; host susceptibility, season of onset; incidence, severity, crop mortality or yield loss measurements. Indirect Official Acknowledgement: Official Acknowledgement indicators involve an indigenous government communicating threat information to its citizens and trade partners (commerce). Communications may include acceptance or declaration of an event; denial or no response; and may include a recommendation for a control strategy; or a call for outside assistance Official Action Official Action indicators refer to an indigenous government's response to a plant health threat. Possible actions include official investigations; implementation of countermeasures; activation of biosurveillance, survey or screening; information suppression; prosecution of individuals, organizations or companies relating to activities in black markets, crop price gouging, pesticide profiteering, or failure of a government official to handle an event properly. Demand for Control and Countermeasures Demand for Control and Countermeasures indicators refer to shifts in supply and demand; mobilization of resources; and economic hardship resulting in farmers unable to pay for supplies and services. This category includes requests for military assistance to control pests; seeking behavior as stockpiles deplete (e.g. traveling to acquire food, seed, or pesticides); innovative mitigation behaviors (e.g. use of vinegar or bleach instead of pesticides); and calls to neighboring regions for assistance. Local perception of threat Local perception of threat indicators refer to an increase in anxiety; farmer pleas for assistance; increased suicide rates (common in India), hoarding, panic buying; loss of confidence in government; news editorials, protests, or rioting; and migration to more stable communities. Changes in business practices Changes in business practices indicators include changes in import/export patterns; changes in choice of crops grown; business closures; profiteering, black market development; new technology introductions; price increases; and changes in advertising. Integrity of infrastructure Integrity of infrastructure refers to market closures; empty store shelves; diminished pesticide, seed or other critical supplies or delays in distribution; riots; establishment of martial law; open rejection by farmers of recommended practices; widespread bankruptcy or starvation; heavy dependence on national or international support.

Direct I&Ws are used in traditional international disease reporting by organizations such as GOARN and ProMED that follow the standard epidemiological practice of initial surveillance followed by event verification. Direct I&Ws include diagnostic impression, which refers to whether the causative pathogen has been reported to be known and well understood or unknown. Diagnostic impressions are often ambiguous, particularly when an indigenous society is confronting a pathogen that is either an unrecognized or unexpected agent or is identified initially as well understood, but defies countermeasures thought to have been effective in the past.

Direct I&Ws also encompass epidemiological features. Specific parameters for this marker include unique clinical presentation, geography, high morbidity or mortality, unexpected appearance of disease in relation to season, and discreet population involved. Geography refers to whether an outbreak of disease is unifocal (described in patients or animals located at a single facility) or multifocal (described at multiple facilities or present at a regional level). It also includes descriptions of expected or unexpected propagation patterns, such as the presence of multiple foci when previous experience with a disease dictates a single-point origin for an outbreak or epidemic. Discreet population refers to a high incidence of disease observed in a specific group, such as children or health care workers. As discussed below, the presence of a disease that is multifocal and is directly incapacitating health care workers may point to a compromised medical infrastructure.

Substantial differences between direct markers of biological events affecting animals and direct markers of such events affecting humans have not been observed, except that animals tended to experience disease on farms and were cared for by veterinarians, whereas humans tended to experience disease in hospitals and were cared for by medical workers.

As summarized in Table 2A and 2B above, indirect I&Ws include official acknowledgment, official action, demand for medical services, local perception of threat, changes in business practice, and integrity of infrastructure.

Official acknowledgment refers to the behavior of an indigenous government when communicating threat information to its citizens. Three basic messages that governments send have been observed: (1) acceptance or declaration of an event, (2) denial, or (3) no response. It has been observed that public announcements by local officials indicate a unifocal biological event, while those by national officials suggest an epidemic spreading throughout a region. Types of announcements may include explicit declarations of the occurrence of an outbreak, a health alert or declaration of a public health emergency, and calls for international assistance. When a biological event has defied all local countermeasures, announcements become more emphatic and culminate in one or more requests for international assistance.

Official action refers to an indigenous government's response to a biological event. It includes official investigation, implementation of countermeasures, activation of biosurveillance or screening, information suppression, and prosecution:

-   -   Local official investigation implies a lower level of official         concern regarding a biological event than investigation         conducted by national officials or international organizations         called in for assistance, such as WHO.     -   Implementation of countermeasures covers a wide variety of         actions, such as vector control measures (like mosquito spraying         campaigns), hygiene campaigns (like official orders to disinfect         elementary school classrooms), pharmaceutical deployment (like         vaccination campaigns), preemptive infrastructure closure (like         closure of sites of public congregation, such as festivals, to         prevent disease transmission), official orders to quarantine         infected individuals, propaganda or public awareness campaigns,         and closure of borders and ports of entry to prevent the         international spread of disease.     -   Activation of biosurveillance or screening, such as screening         passengers at airports for disease, reveals a shift from passive         to active surveillance.     -   Information suppression is a frequent behavior, irrespective of         the country involved. An indigenous government's attempts to         deny the presence of a biological threat may stem from concerns         about inciting public panic, from the threat's perceived effects         on the economy and tourism, and from a lack of official         understanding of the threat.     -   Prosecution of individuals, organizations, and companies occurs         for a variety of reasons, including failure of a government         official to handle an event properly, unauthorized dissemination         of information, profiteering, and disregard of orders to         quarantine.

Demand for medical services encompasses parameters that reveal shifts in supply and demand, such as increased demand for medical care, pharmaceuticals, and supplies and mobilization of those resources:

-   -   Increased demand for medical care is indicated by a rise in         patient presentations at hospitals; demand for specialized care,         such as intensive care unit support; and calls for military         assistance to support civilian care providers.     -   Demand for pharmaceuticals and supplies includes increased         purchase or use of medications, vaccines, face masks, and         gloves. As a biological event progresses to fully defy local         countermeasures, depletion of pharmaceutical and medical supply         stockpiles is seen. If the public is aware of this stockpile         depletion, both seeking behaviors (such as travel to other areas         to find medicine) and innovative behaviors (such as using bras         as face masks) may be observed. Stockpile depletion will often         prompt calls to neighboring regions and countries for assistance         and mobilization of resources.

Local perception of threat denotes public awareness of a change in the local status quo of disease incidence. For example, social anxiety is noted when a disease appears abruptly and the public is aware that it may be defying countermeasures. Anxiety is also indicated by increased telephone and Internet use. Heightened incidence of depression or apocalyptic feelings is observed and may prompt special psychiatric intervention, such as the creation of suicide hotlines or psychological support centers. As the media's references to anxiety turn into references to panic, hoarding and self-preservation behaviors are noted, including changes in purchasing behavior (such as panic buying of staples or home remedies) and avoidance of sites of public congregation (such as subway stations, stores, and festivals). As a biological event becomes harder and harder to contain, mass voluntary evacuations are observed. The public's loss of confidence in the government's ability to mount countermeasures against the disease leads to expressions of dissent in newspaper editorials, public demonstrations, and eventual rioting.

Changes in business practice are observed when demand outstrips supply and certain items are at a premium. Changes in prices of staples and medicine are observed, as well as profiteering, changes in advertising, and formation of black markets.

Integrity of infrastructure denotes the progressive strain that occurs when a disease defies countermeasures. Compromised infrastructure is observed when hospitals are reported to be overburdened or inundated with patients, often to the point of creating new disease wards or placing patients in hallways because of a lack of space. Infrastructure collapse is observed when hospitals must close or turn patients away because an overwhelming number of patients are presenting, because resources are depleted, and/or because health care workers have been incapacitated by the disease. As an indigenous medical infrastructure progresses from compromise to collapse, a combination of multiple social disruption parameters is observed, indicating a full social crisis: declaration of martial law to preserve basic government functions, open conflict between citizens and government officials, widespread economic damage, and heavy dependence on international support. These parameters indicate maximal social disruption and a loss of ability to control or contain the disease.

It has been recognized that, over time, the identity and characteristics of particular I&Ws may vary with the disease involved and the social routines of the affected culture. Determining what constitutes an I&W thus may depend on understanding local disease baselines and social conditions. Expanded capture media reports from indigenous local sources provide valuable information because these sources reflect timely and qualitative local knowledge of typical and atypical phenomena. This kind of knowledge could be obtained by relying solely on regional news reports.

Step 508. Next, targeting analysis defines the data collection requirements, implied social networks, and information sources 104 needed to obtain the information and data of interest.

Step 510. Output reporting requirements are next identified to satisfy needs of the end users 324 (this step may also be done earlier in the process, after the mission is established, or during subsequent steps).

Step 512. Once the disease-specific and I&W reporting requirements have been established, the next step involves prioritization. For example, a report of diarrhea in a daycare center may take lower precedence than reports the primary trauma care hospital for the city is reporting a sudden inundation of its infrastructure with an unidentified influenza-like illness. Such prioritization is designed to categorize reporting requirements into classes that are easily understood by the end users 324, such as “Warning”, “Watch”, and “Advisory”. Prioritization is dependent upon, and partly defined by the end users 324, whether for internal monitoring purposes or to provide reporting to the healthcare community or the general public. Prioritized reporting requirements drive the operations tempo, where “Warning” implies a tactical, near real time reaction by the analyst versus an “Advisory” that may be monitored over the course of a week. In other words, this categorization is an easy way to impart the severity and importance of the report to the user community.

Step 514. Once prioritized reporting requirements have been established, an examination of social networking may need to be conducted to cross-match the requirements to specific information and data sources. The product of this analysis is referred to as a targeting matrix. For example, if a key direct indicator is reports of disease in rodents, then one may need to consider building a network of reporting that includes the local sanitation authority. If a key indirect indicator is local depletion of ventilator supplies, then one may need to build a relationship with the local distributor of ventilators as well as medical facilities that use ventilators.

Step 515. The next step, developing an operations plan, draws together all of the aforementioned steps into a document that defines precisely how the mission will be executed, from mission statement, to CIRs, prioritized reporting requirements, operation tempo modes, communication channels that include the social network of reporting, and so on. While the document itself eventually becomes a desk reference for the biosurveillance analyst, it is a “living” document that is modified to reflect changes in operational requirements and refinements in the analytic process over time.

Turning now to FIGS. 6A and 6B, shown therein is an information and workflow diagram that illustrates the steps in the process after those described immediately above.

Step 602. The system 200 currently targets local, national, and international open source media information sources 104. Because those information sources 104 may be defined geographically, the world is broken into countries and regions for operational surveillance purposes: Southeast Asia, East Asia, Central Asia/Caucuses, Middle East, Eastern Europe/Baltic States, Oceania, Western Europe, Africa and Central/South America (a tenth region, North America, is currently not under biosurveillance). The system 200 includes, in one embodiment, nine regional teams consisting of thirty-six analysts 318, that conduct and report on disease occurrences worldwide. There are nine senior analysts 320 for each region, which supervise and assure quality of the analysts' 318 work. A chief analyst 322 oversees the overall analytical quality. If necessary, a daily situational awareness report based on reports from the nine regional analytic teams may be generated.

Step 604. It is important to select analysts 318 who have an understanding of the specific countries or geographical regions in which biosurveillance is to be conducted, because cultures within those countries and regions have unique languages, customs, political structures, communications infrastructures, and other unique properties and factors that may not be apparent to analysts 318 who are less familiar with the specific countries/regions under surveillance.

Step 606. A checklist for identifying information sources 314 may be used once a mission has been identified. The checklist may include categories defined by public health, demographic, historical, socio-economic and social mobilization parameters, as follows:

-   -   The public health category emphasizes capturing public health         websites, agricultural, community-based health-related         internet-based sources, official medical, health, information         dissemination or other agencies responsible for the provision of         basic human and animal health services. Collection of         information from these information sources 314 is the first         stage of examining an area of interest;     -   The demographic category includes country-specific guidance on         emerging demographic trends, including the appearance of new         ethnic communities or “imagined communities” based on migration,         economic self-identification, and other social mobilization         networks;     -   The historical category includes identification of zones of         known biological activity, previous hot zones of conflict or         social instability, as well as guidance on identifying existing         physical infrastructure. In addition, former or current reliance         of a country on a specific foreign state or non-state entity,         capable of establishing zone(s) of influence within a particular         country is provided. Internet sourcing related to social         mobilization networks and zones of historical, demographic, and         socio-economic activity will be most active during an ongoing         event and in the post-event response;     -   The socio-economic category emphasizes micro-economic changes,         which may influence the emergence or denial of service to new         sources appearing within a community. Understanding major         micro-economic players in the region presents an opportunity to         examine the emergence of (1) physical infrastructure, such as         sponsored hospitals, pharmacies, or traditional herbal clinics,         and (2) new communities based on micro-loan giving in the area         of interest.     -   The cross-reporting category identifies cross-reporting trends         due to economic, cultural, ethnic or other interests between         countries. For example, Russian cross-reporting due to         significant Russian ethnic migrant population in South East         Asia, the Former Soviet Union and the Middle East, specifically         in Egypt and Israel, is provided;     -   The alternative source category includes web 2.0         environment-generated blogs, public forums, semi-private forums         and lists, which allow increased specificity in targeting a         particular population or area of interest.

An analyst 318 may suggest an information source 104 to be included on the document source index 402 by entering the URL of the information source 104 into a template (described later). The template is an intake form that is used by programmers to develop scripts to capture information and data from the proposed new information source 104.

Step 608. The list of information sources 104 to be crawled is updated and stored in the appropriate database.

Step 610. In order to easily search all of the documents downloaded from the information sources 104, the documents are preferably encoded to a common format. Thus, once an information source 104 is identified, a unique script is written for that information source 104 that captures the title, publication date, and “body” of the published files that are available from the information source 104. If the web document is an HTML file, it is relatively easy to locate the title, date, and body from the metadata associated with the file.

Step 612. For some information sources 104, all of the new/updated information and data that are available may be downloaded and then searched at a later time. For example, if an RSS feed is used, the feed provides all the new/updated information added to a website, regardless of whether it is relevant to the specific I&Ws/parameters. In the case of a website without RSS, the site is crawled according to a pre-determined schedule, as previously described.

Step 614. The unique script associated with a particular information source 104 is executed to extract the relevant data for subsequent analysis purposes.

Step 616. The extracted data are then encoded into a new format and saved to a new file. The encoded file format is preferably UTF-8 encoding. It is contemplated that other encoding formats could also be used, such as a markup language, or it could be unformatted text, or other format. The original language of the document is preserved; thus, if the downloaded information or data is in Spanish, the encoded file will also be in Spanish.

Step 618. If the original information and data are in a language other than English, the title of the downloaded web document is automatically machine translated into English if machine translation service is available for the particular language. It is contemplated that manual translation may also be used.

Step 620. The encoded file is then saved in the article repository database 310.

Step 622. Once a new XML file is added to the article repository database 310, the document index 404 is updated to include the words associated with the subject and body elements of the new XML file, making subsequent searches of the keywords within the new XML file possible.

Other sources of information may be accessed manually, and include, but are not limited to, websites available from ProMED, GIDEON, WHO, CDC, and OIE, to name a few.

Step 624. If a particular country or region is under surveillance, a ranked list of relevant documents may be updated for analysts to easily identify the level of activity in an area of interest.

Step 626. The next sequence of processes in FIG. 6B, identified as connector nodes B and C, are conducted in parallel or in series, and will be described below. Once those sequences are complete, the process returns to a decision step, in which the system 200 determines whether additional steps are required to be executed. If not, the system 200 is terminated. If additional steps are warranted, the system 200 branches back to one of several earlier steps as indicated by connector node D in FIG. 6A.

FIGS. 7A and 7B are an information and workflow diagram for connector node B shown in FIG. 6B.

Step 702. The system 200 automatically executes the RAMBO alert generation and modeling module 306. No manual intervention is required, except to input modeling parameters and model initial conditions, as necessary. The alert generation and modeling module 306 analyzes the information and data in the article repository database 310, and outputs one or more ranked lists of articles of potential interest.

Step 704. The module 306 may be set up to analyze a particular event, city, region, country, or other geographical limit. The module 306 was developed using Bayesian casual models as an information retrieval mechanism for events associated with complex keyword clustering. The module 306 provides a “back-stop” mechanism whereby analysts 318 who are busy searching at the more sensitive, individual keyword level (described above and below) do not fail to notice a larger, more complex evolving event. This is an operational safety mechanism to ensure that “Stage 4” biological events of possible transnational threat potential are not overlooked.

Step 706. The module 306 scans all encoded files that have been downloaded from the information sources 104, and identifies those that are relevant to a particular event, city, region, country, or other geographical limit provided for in the previous step. This is an automatic routine that may be executed as soon as new data are downloaded from the information sources 104.

Step 708. The module 306 maintains an index of the top ten (or more or fewer) downloaded documents for a particular area of interest based on keyword hits. The module 306 issues an email notice 307 to the analysts 318, which includes links to the top documents so that the analysts 318 may open the files and review them.

Step 710. The module 306 is based on Bayesian probability, which interprets “probability” as “the degree of belief (or strength of belief) an individual has in the truth of a proposition.” The analysts 318 establish a new proposition, i.e., the probability of occurrence of a biological event at any given moment based on the most recent data being downloaded from identified information sources 104. The propositions may be of the form of increasing or decreasing degrees of belief in the occurrence of a biological event such as a plant, animal, or human disease outbreak. The modeling makes use of custom software developed by The MITRE Corporation to implement the Bayesian modeling, and some tool-specific data management and configuration services. Currently, the system 200 includes several Bayesian models in multiple languages running in parallel or in series.

Step 712. The Bayesian model uses data obtained from queries of the documents to drive “evidence nodes” in the model, which are used to detect the presence or absence of certain conditions reported in the news articles.

Step 714. If certain evidence is found, that drives a “decision node” to conclude that there is or is not sufficient evidence to support an intermediate or final conclusion regarding the aforementioned proposition.

Step 716. The module 306 outputs email alerts if anomalies are detected for a given location being monitored based on the data obtained from the information sources 104 relevant to that location. Those email alerts may be separate or part of the aforementioned email notices 307. An anomaly may be detected based on simple rules, e.g., an increase in the number of keywords found in relevant documents obtained from information sources 104 above a threshold value, or percentage increase above an established baseline.

Step 718. When an analyst 318 clicks on one of the email links, a web application is opened which provides an interface for the analyst 318 to view the actual web documents containing the keywords of interest for the model found to be relevant to the aforementioned proposition.

Step 720. The results of the model are verified and validated by reviewing the top web documents identified by the model as suggesting whether a new or developing event is actually unfolding or not. Usually, this initial filtering step forms part of the daily event reporting by the analysts 318.

Step 722. If the model does identify web documents that suggest that a new or developing event is unfolding, the analyst 318 may estimate a probability of such occurrence based on experience and historical information about the area of interest. Thus, the analyst 318 may conclude that there is a high probability of an unfolding event based on his or her review of the top ten documents.

Turning now to FIG. 8, shown therein is an information and workflow diagram for connector node C shown in FIG. 6B. After analysts 318 refer to the Bayesian model outputs that prioritize articles for review, the analysts 318 use the “Newsstand” document search and reporting interface module 308 as a second filtering step to identify relevant event-related information.

Step 802. If the “Newsstand” document search and reporting interface module 308 is used, the web application is first launched.

Step 804. The “Newsstand” document search and reporting interface module 308 can execute pre-established saved searches automatically, or manually. New searches may also be created and saved. The searches target both direct indicators, such as disease names or words like “outbreak,” and indirect indicators, such as “panic”, “shortage”, or “black market.” For example, a search for information and data relating to avian influenza might use the following query, either entered as a new search or saved as a previous search that is re-run:

-   -   H5 OR H7 OR HPAI OR “avian influenza” OR “bird flu” OR “avian         flu” OR influenza OR flu OR “H5N1” OR “dead bird” OR “bird         die-off” OR pandemic OR “influenza-lie” OR “avian flu” OR         pathogenicity OR “human-to-human” OR person-to-person” OR         tameflu OR pneumonia OR oseltamivir

Other common searches may be saved for the following events: “laboratory accident”, “biological weapon use”, “ventilator depletion”, “disease/healthcare workers”, “unknown disease”, “flooding”, “natural disasters”, industrial accident”, and for the following I&Ws: “quarantine”, animal die off”, “declared event”, “disease symptoms”, “official acknowledgement”, etc. (see Tables 1 and 2 for I&Ws that may identify parameters, or more specifically keywords, for use in searches).

Step 806. Keyword searches may be scheduled to be execute on any frequency established by the analysts 318. Scheduled saved searches are run by Moab, a job-scheduling and management package targeted at a cluster environment. The trigger for a scheduled saved search is an event in the EVENT_SCHEDULE table that corresponds to a saved search. The scheduled saved search is run at the scheduled time and date on one of the cluster nodes, and results are saved in the database for the user. Log files are kept for each run. Only the last two results are saved. FIG. 9 shows an interface template 902 for setting up the scheduled search. The template 902 includes fields for search day, time of day, search terms (Boolean or otherwise), sources to be searched, language, display, timeframe for documents, and userid.

Step 808. Saved and scheduled searches are executed according to the EVENT_SCHEDULE table criteria.

Step 810. The web application then returns a list of search hits to the search results user interface 410.

FIGS. 10A through 10C are an information and workflow diagram that illustrates the steps in the process after those described in FIG. 8. The series of steps begin with the “Newsstand” document search and reporting interface module 308 running. The module 308 includes a search user interface 408 and a search results user interface 410. The search user interface 408 is shown in FIG. 11, and includes a “Saved Searches” tab 1102, a “Search” tab 1104, a “Clipboard” tab 1106, a “Discus” tab 1108, a “Messages” tab 1110, and a “Results” tab 1112. The “Saved Searches” tab 1102 displays all the searches that the analyst 318 has run.

The search user interface 408 allows analysts 318 to form ad hoc search queries separate and apart from the automated searches and models described above and shown in FIG. 8. Queries allow an analyst 318 to search stored documents for keywords and more complex search terms, e.g., concatenations of keywords. An analyst 318 can specify which language to search as well as date ranges for the search and geographical areas for the sources of the document. An analyst 318 can also specify certain document sources to search or one or more countries on which to target the search. In addition, the “Search” tab 1104 allows an analyst 318 to specify the number of results to return, and it provides a graph of documents (typically news articles) over time as well as a comma-separated-value spreadsheet (not shown). An analyst 318 can also save a search to be run at other times, as described previously. Search term translation is provided for certain languages.

The entire system 200 depends upon (1) the validity of the taxonomy and (2) the ability of individual analysts 318 to continuously examine a very large body of media-produced web documents from the information sources 104 to accurately identify events of interest. Because an inherent limit exists on the number of documents in the article repository database 310 that a human can process accurately in a given period of time, the detection of events is limited by the number of analysts 318 working on a particular region. Although analysts 318 may accurately, precisely and consistently identify events, the events so identified will probably contain events that are mundane (i.e., “normal”) and not of immediate interest. In addition, analysts may miss some “abnormal” events of high immediate interest in the “Clipboard” tab

The “Clipboard” tab 1106 allows an analyst 318 to save particular documents from a search across research sessions.

Step 1002. The “Results” tab 1112 allows an analyst 318 to retrieve documents corresponding to an item in a results list from the article/document repository database 310. An analyst 318 may access the document source information by language, country, area, region, and alphabetical order. The URL of the information source 104, captured frequency information, and other information are provided. FIG. 12 shows a portion of the “Results” tab 1112 for a particular search involving the query 1202. The search results are displayed with links to documents along with date, source name, and source country.

Clicking on a document link retrieves the document from the article repository database 310 and displays it in another window 1300 as shown in FIG. 13. The displayed information will include the title (“Headline”) 1302, source name 1304, publication date 1306, a translation link 1308, a URL link 1310, and the main body 1312 of the web document. Clicking on a header will sort the data using that column's data entries as the sort key.

Step 1004. The translation link 1308 is available for each stored web document if translation into English is available for the language of the document.

Step 1006. If translation is needed and available, it may be demanded by an analyst 318 (translations may also be done automatically). This translation is a front-end access to the machine translation gateway (MTG) subsystem, which is part of the “Newsstand” document search and reporting interface module 308. The MTG is designed as a gateway, offering a standardized interface for users and applications that masks the complexities of one or more off-the-shelf machine translation engines well known in the art. An analyst 318 makes use of translation services using a standard protocol, and is not required to know the specific configurations of various supporting translation technologies. The requesting application submits information to be translated to the MTG subsystem, which then forwards the request to a Translation Control and Scheduling component (not shown) of the MTG subsystem. This component is responsible for determining which of the translation resources is most appropriate for the request, and also for maintaining the schedule and status of the various translation engines. Once the appropriate engine has been selected, a request is sent to the engine for translation service. When the translation is complete, the Translation Control and Scheduling component prepares the response message and forwards it to the MTG interface. The interface prepares and sends the response to the calling application.

Step 1008. Based on the results shown in the “Results” tab 1112, an analyst 318 may want to create and save additional searches to better clarify existing event-related information. FIG. 14 shows a template 1400 under the “Search” tab 1102 that allows an analyst 318 to enter the parameters for the new search. The template 1400 includes fields for entering a search description/title 1402, search query terms 1404, documents to be searched 1406, date limiters 1408, display options 1410, and source geography limiters 1412. Analysts 318 scan hundreds, possibly thousands, of web document article titles and text daily. Analysts 318 may search by region, country, or specific information source 104. Display options 1410 display a list of titles, or as a graph of hit counts over time. Search query terms 1404 may follow normal Boolean syntax.

The analysts 318 consider whether the information and data from an information source 104 accurately reflect the original website article, whether the event was expected, whether the event is important, and what is the tone/quality of the information and data. Analysis is done with the following premises in mind: 1) any major emerging biological event will disrupt everyday social life of individuals, communities, and institutions in affected areas, and 2) types of description linked to outbreaks of disease are similar across countries/cultures.

Step 1010. Based on event information, and the reporting requirements previously identified for a particular mission, an analyst 318 viewing the search results will determine the priority of reporting. Typically, this will involve identifying whether event reporting will be designated under one of the “Advisory,” “Warning,” and “Watch” prioritization categories (or they may be the colors yellow, orange, or red, or some other color, pattern, or indicia). An Alert value may be assigned to the country level (although other geographical or political levels may also be used).

Other information sources 104 are available to the analysts 318, such as Google® Alerts, which allow a user to request alerts that can be sent to a user's email address. This application is available for many countries and languages, and may be set up for frequent alerting (“as-published” or “daily”). Still other open source information sources 104 are also available and are checked frequently in a manual fashion because they are not set up in a way that allows the system to automatically download information and data. These information sources include the Ministries of Health and Agriculture, Flu Trackers, ProMED, OIE, and others.

Step 1012. Next, an analyst 318 will compare the event information to the biological taxonomy for the particular mission to determine the stage of the event using a heuristic scale or staging model. The present heuristic scale describes the state of biological events according to a heuristic model as described below. The basic framework for the system 200 is that it follows a biological event as it evolves from unifocal, to multifocal, to uncontained, and finally to a state that induces social collapse. On the whole, the model's first two stages (1 and 2) are reminiscent of public health definitions of biological events, whereas the final two stages (3 and 4) are based on sociological definitions. The following descriptions focus first on I&W stages for biological events that affect humans, and then present caveats for biological events that affect animals and plants. Obviously, I&Ws for the former diseases impact medical services and those for the latter diseases affect veterinary services. For zoonotic pathogens (those that affect both animals and humans), I&Ws exist that affect both types of service.

Stage 0 (Environmental Risk Conditions Present or Potential Antecedent to a Biological Event). Stage 0, indicative of a potential increased risk for disease, is a pre-event condition that applies to specific mosquito-vectored and waterborne diseases in certain areas of the world, such as RVF in Kenya, VEE in Venezuela, and diarrheal illness in India and Bangladesh. It has been found that in most of these cases, local public and government officials appeared to respond to a history of disease following both expected and unexpected large-scale seasonal floods. This response included public awareness campaigns, active disease surveillance, and mosquito spraying.

One important feature of the Stage 0 condition is the impact of excessive flooding on medical infrastructure and the ability of the indigenous community to respond to the subsequent emergence of disease. This impact was observed during the 1995 epidemic of VEE in Venezuela, and the 1997-1998 epidemic of RVF in East Africa. It has been observed that if the local infrastructure was already compromised prior to the appearance of a rapidly transmissible disease, the associated social disruption might proceed at a much faster pace.

It has been proposed that Stage 0 is relevant for Kenya, Venezuela, India, and Bangladesh. Stage 0 is likely to be relevant for other areas of the world as well; however, analyses to date have been limited to case studies.

Stage 1 (Unifocal Biological Event). This stage represents the beginning of an identified biological event, when human cases of a disease have presented to a single medical facility and the event has therefore appeared as a unifocal phenomenon. This condition is analogous to the epidemiological term “outbreak,” which denotes the unusual appearance of a disease, above baseline but limited in scope. The present model simplifies this definition to mean the appearance of cases at a single medical facility, and thus reflects the uncertainty of dealing with media sources rather than information vetted by the public health community.

Direct I&Ws include reports of human disease that describe clinical impression and epidemiological features of the event. Indirect I&Ws, which are not substantial in Stage 1, fall into two categories: official acknowledgment and official action. Official action is limited and typically includes official investigation and low-level countermeasures, such as vector control campaigns (like mosquito spraying).

For biological events that affect animals, the basic description is the same except that explicit reporting originates in official and unofficial communities linked to veterinary medicine, agriculture, industry, field biology, and forestry, as expressed to local media. Locations where such events occur include veterinary, research, agricultural, and industrial facilities; park lands; and local communities. The staging of such events is similar to that for human disease, with the event being reported to occur at one such facility. A key difference, however, is the implementation of countermeasures. An official order to quarantine the involved animal or animal herd may be reported. If a pathogen with serious economic repercussions, such as foot and mouth disease in cattle or H5N1 avian influenza, is suspected to be present, mass culling of animals may be mandated for both the involved facility and surrounding areas. For the current situation with A/H5N1, this stage may include military support for mass culling of animals in developing nations.

In summary, Stage 1 represents a unifocal biological event with limited countermeasures, minimal demand for medical services, and minimal local perception of threat. As an example, on Oct. 18, 2005, Bashkirstotan village in Russia reported a human case of anthrax. A 67-year-old female resident of Kovarda village in the Gafur region was hospitalized with the cutaneous form of the disease. This report is a single case and thus represents a Stage 1 event.

Stage 2 (Multifocal Biological Event). In Stage 2, disease is present in multiple facilities, and is analogous to the public health definition of an “epidemic.” Stage 2 therefore describes a phenomenon that is multifocal but contained (as perceived by the indigenous society). Early I&Ws in this stage include reports of increased demand for pharmaceuticals and medical supplies.

As the biological event progresses, direct I&Ws may include more epidemiological details. However, it is at this point that concern is expressed publicly about whether the initial clinical impression was correct. This step takes place if the pathogen in question is a well-known one that is now displaying new transmission characteristics, or if the initial clinical impression was wrong, and a newly emergent pathogen is present.

Indirect I&Ws include elevated official acknowledgment, with national in addition to local officials making statements regarding the event. Official action progresses to include activation of biosurveillance or screening of individuals and attempts to isolate those infected. Under the category of official action, official investigation progresses to involve national and occasionally international organizations. Countermeasures expand to include disinfection campaigns, vaccine deployment, and official treatment recommendations, reflecting a reactionary posture by the indigenous government to exert control over the situation. Increased demand for medical care, pharmaceuticals, and supplies also appears. Stage 2 sees the first documentation of local perception of threat, with the public reported as being “anxious” or “concerned” about the disease.

When animal populations are involved, a “multifocal” event occurs in more than one of the locations described for Stage 1, above. A key epidemiological feature for Stage 2 events involving animals is documentation of which animal species are involved. This information is important for determining the possible identity of the pathogen. If the event involves a suspected pathogen of economic consequence, more aggressive and widespread animal culling is observed, frequently with military support. Additionally, domestic and international trade restrictions may be implemented by both the host nation and other countries. An official order to quarantine an entire affected facility may be issued. Active screening of animals and animal products at farms, festivals, markets, businesses, and ports of entry is also seen. Disinfection of individuals, equipment, and transportation vehicles is observed, along with roadblocks and checkpoints.

As an example of a Stage 2 event, a biological event of H5N1 avian influenza in poultry was confirmed at three farms in Huainan, a city in Anhui province, People's Republic of China, according to China Central Television (CCTV). The biological event was first reported on November 6, when 800 domestic poultry died. Approximately 126,000 poultry within 3 kilometers (2 miles) of the affected area were slaughtered as a precaution, CCTV reported. Five days later, China confirmed 21 avian influenza biological events in nine provinces, autonomous regions, and municipalities, as reported by Yin Chengjie, Vice Minister of Agriculture, in a news conference in Beijing. Yin said that the avian influenza biological events had killed 144,624 poultry, while a further 21,184,200 had been culled. He also warned that the H5N1 situation in the country was “severe.” This series of reports depicts a multifocal biological event with heightened government countermeasures, characteristic of a Stage 2 event.

Stage 3 (Uncontained Multifocal Biological Event or Infrastructure Strain). Stage 3 denotes an uncontained multi-focal event that affects the medical infrastructure to the point of strain. Key I&Ws include declaration of the inability to contain or control the disease in question, depletion of vaccine and drug stockpiles and medical supplies, and concerns about whether medical facilities can continue to handle patients. If the disease entity continues unabated, social collapse is imminent.

Direct I&Ws escalate to include health care workers becoming infected, which may occur with such pathogens as influenza or SARS. This marker is an ominous sign, as it indicates a serious potential threat to the continued functioning of medical facilities.

Further diversification of indirect I&Ws is noted in Stage 3. Official acknowledgment includes declaration of a health emergency and requests for international assistance. Official action is increasingly aggressive, with international organizations featuring more prominently in official investigations. Such action includes preemptive infrastructure closures (of schools, festivals, and public transit, among others), official orders to quarantine, initiation of propaganda campaigns, and border and port-of-entry closings; information suppression and prosecution of citizens and organizations appear as well.

Also in Stage 3, demand for medical services diversifies. There is increased demand for specialized, innovative, or alternative medical care. Demand for pharmaceuticals and medical supplies progresses to the point of stockpile depletion and subsequent mobilization of resources from neighboring regions. Local perception of threat evolves and leads to new behaviors, such as hoarding and self-preservation (like panic buying of staples); avoidance of sites of public congregation; and expressions of dissent toward officials whom the public now deems incompetent in their handling of the event. Changes in business practice reflect market responses to a severe shift in the supply-and-demand ratio, as local medical resources become depleted. Price gouging and formation of black markets are seen. Integrity of infrastructure is observed to be compromised, with medical facilities reporting “strain” or seeing patients at “full capacity,” implying few reserves left to treat the public. Multiple schools and businesses close on account of widespread illness—not because of an official mandate to control its spread, but rather because the disease is directly affecting these institutions. These parameters are signs of severely compromised social functioning.

In Stage 3 events involving animals and/or the military's participation in countermeasures is more widespread and common. Preemptive infrastructure closures include individual businesses, farms, festivals, and the stock market. Local perception of threat is manifested in panic selling of farm animals at risk of exposure to the disease, panic buying of animal products in anticipation of a market restriction, and avoidance of public places. Public dissent may be observed as both the disease itself and official countermeasures, such as animal culling, begin to have a severe effect on the livelihood of the agricultural community. A compromise in the integrity of infrastructure is noted with the closure of individual businesses, farms, and festivals; these closures are not preemptive, but rather stem directly from the pathogen itself, from attendant countermeasures, and from social anxiety. Severe economic damage for the host nation is imminent.

As an example of a Stage 3 event, multiple reports in Indonesia reflected a strained veterinary infrastructure that can no longer respond effectively to an avian influenza crisis starting in January 2004. For example, “A senior official at the Jakarta Animal Husbandry, Fisheries, and Maritime Affairs Agency, Adnan Ahmad, said it was impossible to respond immediately to every report because local animal husbandry agencies had only three to four veterinarians to cover the entire regency. He did acknowledge, however, that several officials at local animal husbandry agencies in Greater Jakarta had failed to make avian influenza their top priority since the national campaign to eradicate the disease was launched by President Susilo Bambang Yudhoyono in September.” Additional reporting revealed the magnitude of the animal epidemic: “Mathur Riyadi, Director General of Veterinary services stated that the avian influenza biological event has affected 21 out of 30 provinces in Indonesia with the number of chickens killed by the virus totaling 9.53 million. Avian influenza has been found in at least 132 regencies and cities across Indonesia. Additional investigation also indicated preliminary attack of avian influenza in four other provinces, namely South Sulawesi, Jambi, East Kalimantan and North Sumatra, he said on the sidelines of a hearing with legislators at the parliament compound.” This is a multifocal biological event with strained veterinary infrastructure, and is indicative of a Stage 3 event.

Stage 4 (Maximally Disruptive Biological Event or Infrastructure Collapse). Stage 4 is the regional- or national-level end point for a socially disruptive biological event, when social collapse results from the disease's sustained defiance of countermeasures and maximal social disruption occurs. Key I&Ws of a Stage 4 event include conflict due to public outcry over the handling of the event, mass evacuations, and refusal of the medical infrastructure to see patients.

Stage 4 is defined principally by indirect I&Ws. Official action is maximally aggressive, with preemptive infrastructure closures occurring systematically across multiple sectors (such as schools, festivals, and public transit). Systematic closures and redirection involving the entire local medical infrastructure are noted, with medical facilities becoming wholly devoted to treating those with the disease. Mass quarantines of thousands of people may be seen. Information suppression and prosecution of citizens and organizations escalate to arrests and threats of capital punishment if control measures such as quarantine are violated. Medical facilities may also be punished for refusing to see patients.

Demand for medical care results in a total depletion of pharmaceuticals and medical supplies; military transport of materiel on a national level, with mobilization of internationally acquired supplies, may be observed. Absenteeism is observed among health care workers and first responders due to disease, fear, or both. Consequently, dependence upon military medical support reaches maximum levels.

Local perception of threat appears as an abject lack of public confidence in the government's ability to handle the situation. This lack of confidence takes such forms as open defiance of official mandates, like quarantine or curfew; rioting; and open conflict between civilians and law enforcement or military personnel. Psychological impact may be observed, with signs of mass depression, anxiety, and apocalyptic and suicidal ideation. Counseling hotlines and support services may be inundated with requests for assistance. People thought to have the disease may be ostracized. Patients may avoid medical facilities for fear of receiving inadequate care, being placed in isolation or quarantine, or becoming infected by the pathogen. Conversely, medical facilities may be at the point of requiring systematic security, both to hold back the panicked public and to prevent health care workers from leaving.

Business practices evolve to the limit of tolerable price gouging. Emergency price controls on pharmaceuticals and basic staples may be broadly implemented.

Integrity of infrastructure reaches the point of collapse at the societal level, with martial law having become the mechanism for maintaining order. Important social functions, such as weddings, are canceled. Incoming and outgoing international air traffic is terminated, as are diplomatic visits by representatives of other countries, and nonessential foreign diplomatic personnel evacuate. Indeed, it is at this level that ostracisms of the affected country by the rest of the international community may be observed.

During Stage 4 biological events involving animals, extreme measures may be taken to assist with animal culling, such as having prisoners perform this task. An entire city or region may be officially ordered into quarantine, with military or police enforcement. Prosecution of citizens or businesses for unauthorized dissemination of information, profiteering, or disregarding a quarantine may be observed. Maximal local perception of threat is reflected in public demonstrations, rioting, and refusal to comply with official orders. Broad, sector-wide, systematic collapse of infrastructure occurs, including industry-wide closures, widespread economic collapse with gross stock market changes, loss of access to basic staples, declaration of martial law, declaration of a complete social crisis, and total dependence on international support.

Because a disease that is focused primarily in animals generates less social anxiety than one that is actively incapacitating and killing humans, a Stage 4 biological event involving animals tends to generate less social disruption than one involving humans. For example, mass evacuation and panic are typically not seen in a Stage 4 event that involves animals.

In summary, a Stage 4 biological event entails maximal regional- or national-level social disruption and a complete loss of containment. An example of a Stage 4 event is the SARS outbreak in Guangdong, People's Republic of China, 2003. In April 2003, official acknowledgment no longer involved denying the presence of SARS in Guangdong; rather, its presence in areas previously rumored to be infected was confirmed. Official action involved maximal enforcement of countermeasures and criminal prosecution for hoarding, speculation, and illegal manufacturing of medical supplies, antiviral medications, and general staples. In addition, official action sought to provide sustained medical relief for SARS patients. The government prosecuted health care workers and medical facilities for refusing to treat SARS patients and prevented medical staff from leaving hospitals by posting guards at hospital entrances and exits. In some cases, the police arrested runaway SARS patients and mandated home confinement of all people who came in contact with patients confirmed to have the disease. Demand for medical services was indicated by reports of army medics mobilizing to fight SARS in Beijing, and of airlifts bringing medical supplies to stockpile-depleted areas. These reports also focused on heightened local perception of threat, noting multifocal rioting and voluntary evacuation, worker protests, discrimination against workers with SARS, various efforts at self-preservation, and innovative behaviors. Finally, reports of hospital closures in SARS-affected areas and temporary closures and work suspensions at national financial and historical institutions suggested infrastructure collapse. Additionally, temporary suspension of marriage licenses in Beijing demonstrated that SARS was gradually eroding the very fabric of social life in China.

Stage P (Preparatory Posture). Countries often assume a preparatory posture in anticipation of a biological event. Sometimes, this posture implied the presence of an event when there was none; other times, this posture anticipated the presence of an event that was confirmed once active surveillance measures were implemented. It was found that, particularly in closed societies, a declared preparatory posture for a biological event was a possible mechanism for disinformation, to explain the abrupt appearance of countermeasures and social disruption related to the actual presence of a disease. Therefore, Stage P is proposed to represent a preparatory posture that reflects anticipation of a biological event. Stage P indications and warnings fall solely in the indirect category because presumably no biological event is actually present yet. However, if an actual event occurs during the course of preparation and is later reported by the host nation, Stage P may be changed to Stage 1 or 2.

Indirect I&Ws during Stage P include official acknowledgment, official action, demand for medical services, and local perception of threat. Official acknowledgment typically takes the form of an explicit official announcement that preparatory measures are required because of concern about an imminent biological threat. Official action includes official investigation, implementation of countermeasures, and activation of biosurveillance or screening. Prophylactic countermeasures include vector control measures; hygiene campaigns; activation of pharmaceutical programs, such as vaccine delivery; preemptive infrastructure closures; quarantine of inbound international flights and ships from affected areas; public awareness campaigns; and closings of borders and ports of entry. Demand for medical services reflects anticipation versus actual demand, and includes restructuring of the medical infrastructure to accommodate a potential influx of patients requiring specialized care, as well as mobilization of pharmaceuticals and supplies. Local perception of threat is indicated by public concern or anxiety, with dissent occasionally being reflected in media articles that criticize the lack of an appropriate state of preparedness.

With regard to animal disease, prophylactic countermeasures include animal culling in locations where a pathogen may be introduced from a neighboring country. Commerce and trade restrictions may be implemented to prevent pathogen introduction. Preemptive infrastructure closures may include farms. Quarantine may be implemented for a farm or facility, for freight transport, or for inbound international air flights or ships, to enable screening. Demand for veterinary services is observed as an anticipatory condition similar to the preparatory posture for human disease. Local perception of threat regarding animal disease appears as changes in consumer behavior, specifically avoidance of the purchase of animal products.

An example of a Stage P event is avian influenza in the Democratic People's Republic of Korea (DPRK), April-November 2005. In early April, a report surfaced that South Korean officials had intensified preventive measures against avian influenza in the Demilitarized Zone (DMZ) on the border with North Korea. The same report also noted that South Korean officials were stepping up their monitoring of avian influenza at a tourism site in North Korea's scenic Mount Geumgang resort, which attracts South Korean tourists. In September 2005, the official DPRK news agency reported that, despite the State Emergency Veterinary and Anti-epizootic Committee's assurance on July 5 that avian influenza had been eradicated, the government continued to implement avian influenza prevention efforts, such as surveillance of migrant movements, a reporting network, and the development of a vaccine. Finally, in mid-November 2005, a report surfaced that quarantine efforts on the border and in other terminals of the DPRK had been intensified to prevent avian influenza. The report noted that people and goods coming from or via H5N1-affected regions and countries were subject to special measures: those with symptoms of disease were being quarantined, and imports of poultry goods from those areas were strictly banned. These reports indicate that preparatory measures were being taken in response to avian influenza. Since there were no reports of actual disease, the event can be designated as Stage P.

Stage R (Recovery). This event stage is used to characterize societal actions related to recovery from an event.

Table 3A below summarizes the heuristic staging model scale that is described in detail above (Table 3B summarizes the heuristic staging model scale related to plant disease events).

TABLE 3A Stage Condition P Preparatory posture 0 or A “Potential Antecedent to a Biological Event” Environmental conditions favorable to support the appearance of a biological event (for specific diseases and locations only- see text) 1 Unifocal biological event 2 Multifocal biological event 3 “Infrastructure Strain” Severe infrastructure strain, depletion of local response capacity 4 “Infrastructure Collapse” Social collapse at the regional or national level R Recovery

TABLE 3B Stage Condition A Potential Antecedent to Plant Event Crop or forest loss NOT Reported, even if pest is present; Ex: public awareness campaigns, import bans, measures to prevent pest entry or outbreaks, pest interceptions at the border, crop loss not reported but anticipated due to drought, forecasts of reduced crop production 1 Unifocal plant event Crop or forest loss reported in a single village Ex: pest outbreak destroys crops on several farms in a single village, drought affecting crops in a single village. 2 Multifocal plant event Crop or forest loss reported in multiple villages, or a political division that can reasonably be expected to contain multiple villages. Ex: pest outbreak update, official response to multifocal outbreaks, a plant disease affects a third of the crops in a valley known to contain numerous villages, a drought affecting multiple villages and crop loss due to drought is reported. 3 Uncontained plant event Crop loss reported and there are indications of social strain due to the crop loss Ex: food shortage, food aid requested, or food aid sent because a pest outbreak, drought, or flood has destroyed crops 4 Maximally Disruptive Plant Event Crop loss reported, leading to social disorder Ex: industry-wide business closure, financial collapse, mass migration, food riots, looting of grain stores, collapse of government R Recovery Past Crop loss may be reported, along with report of recovery from effects of previous crop loss Ex: official statement of recovery, reports about crop disease events after they have occurred and their impact has ended, report of people returning to farm land after a massive outbreak has ended and the food supply has been restored

Note, Stage A replaces and merges Stage 0 and Stage P.

Applying the above model, it has been found that several pathogens (e.g., plague, SARS, RVF, and VEE) tended to generate high levels of social disruption in certain countries. In a prospective analyses for May 1-Jul. 15, 2005, influenza-like illness was the most common event detected. It generated widespread social disruption, consistently reaching Stage 3 across multiple countries with medical infrastructures of varying sophistication. It was found that 14 percent of the events involving influenza-like illness progressed from Stage 1 to 2, and 60 percent from Stage 2 to 3. Of a total of 50 biological events detected for the Pacific Rim (involving 12 diseases in 15 countries), 58 percent were detected at Stage 1, 26 percent at Stage 2, 16 percent at Stage 3, and none at Stage 4. The retrospective analyses detected a variety of biological events, several of which had reached an advanced stage (3 or 4), as shown in Table 4 below.

TABLE 4 Maximum Stage Level Disease Reached Brucellosis 1 Avian influenza (human disease) 2 Japanese encephalitis 2 Typhoid 2 Dengue fever 3 Diarrheal disease 3 Influenza-like illness 3 Malaria 3 Meningococcal meningitis 3 Polio 3 “Swine Flu” Influenza A/H1N1 3 Plague 4 SARS 4 Rift Valley fever 4 Venezuelan equine encephalitis 4

Certain I&Ws are cause for particular concern. It is believed that they should be among the most important targets for surveillance, given that they may indicate the presence of a biological agent that is novel, is highly transmissible, causes high morbidity or mortality, and/or is defying countermeasures. Consequently, it has been proposed that the following parameters of critical I&Ws require rapid verification:

Implied Parameter Stage Example Illness or death among health care or 1-4 SARS, influenza, veterinary workers emerging zoonotic disease Military medical or veterinary 1-4 SARS, influenza, support emerging zoonotic disease Depletion of stockpiles of 3 SARS, influenza, VEE, pharmaceuticals or medical supplies RVF Regional or national mobilization of 3 SARS, influenza, VEE, medical resources RVF Mass evacuation and panic 4 Plague, Ebola hemorrhagic fever Rioting and martial law 4 SARS, pandemic influenza

The heuristic staging model and numeric scale above is suggested for use when conducting biosurveillance missions; however, other scales may be more useful for biosurveillance and other types of event monitoring. The United States National Weather Service has used various models to describe the disruptive and destructive potential of storm systems. For example, the Saffir-Simpson Hurricane Scale classifies hurricanes on a scale of 1 to 5, which describes progressively severe sustained winds and potential to cause serious damage upon landfall. Similarly, the Fujita Scale rates the damage inflicted by a tornado on a scale from F0 to F5, from light to heavy damage. For both of these scales, the frequency of storm systems is generally inversely proportional to the magnitude of the rating; in other words, milder storms are observed more frequently than severe ones. Those and other scales could be used to represent the stage of an unfolding event of interest.

Step 1014. The next step involves reviewing all of the saved reports, as well as outside sources, to connect multiple media reports and contextualize the event being reviewed. This is a prelude to creating and saving an event report that covers a human, non-human animal, or plant disease event (other events are also possible, as indicated previously). Outside sources of relevant information, as mentioned above, may include, but are not limited to, those published by academia, ProMED, GIDEON, WHO, CDC, and OIE.

Step 1014. The “Watchboard” search and reporting module 316 preferably outputs different types of reports at different levels of users, such as summary event updates, country or regional reports, and global situational reports. An analyst 318 begins the reporting process by launching a web application to generate and save an event report. FIG. 15 shows the basic template 1500 for starting the event reporting process. Dropdown menus 1502 allow an analyst 318 to select the category of the event (i.e., biological), the country/province, a specific reporting disease, plant entity, and whether to include past events. A “create a new event” button 1504 activates a new report window 1600 as shown in FIGS. 17 and 18.

The “Watchboard” search and reporting module 316 includes, as shown in FIG. 16, a menu 1604 with options: “Home”, “Reports”, “Tools”, “Training”, “Administration”, and “Work List”. Under “Tools”, the following options are selectable from a drop-down menu: “Free-Text Search”, “Graphing Tools”, “Source Stats”, “Air Traffic Data”, “File View and Upload”, and “File Upload”. Under “Source Stats”, a user may select to see “My Source Page”, which brings the user to the source management screen 1902 as seen in FIG. 19, where the user may add new information sources 104 and check the status of sources already added. The Source Management screen 1902 shows information sources 104 that have been “scripted” (which, as mentioned previously, involves manually programming the system 104 to “pull” information and data from the information sources 104 in an automated manner). The Source Management screen 1902 also shows information sources 104 that cannot be scripted for automatic downloads (flagged “cannot”). Adding new information sources 104 involves entering the source name, URL, country, province, language, city, encoding type, circulation, local-regional-national focus, type of site confidence, and comments. Encoding type may be, for example, “western (150-8859-1)” or “Unicode (UTF-8)”, etc., which are selectable from a drop down list.

Because the “Newsstand” document search and reporting interface module 308 is not able to handle PDF files, flash websites, and certain other formats, the analysts 318 may use really simple syndication (RSS) aggregation to download information and data from those information sources 104, or they may manually download and review those types of information and data.

Step 1016. The new report window 1600 (FIG. 16) includes a list of saved reports 1602 satisfying the criteria selected from the dropdown menus 1502. An analyst 318 may select an existing report to update, or may create a new event report.

Step 1018. If an analyst 318 wishes to update an already existing event report, the report is opened by clicking on the link provided with the list of save event reports 1602.

Step 1020. If an analyst 318 wishes to create an entirely new event report, he or she may open a new window 1700 as shown in FIGS. 17 and 18 and enter the text of the event report in a text field 1702, and the metadata and staging information in fields 1802. The analyst 318 may also enter a staging value at this point.

Step 1022. When the event report is saved, a signal is sent to a worklist que 2000, as shown in FIG. 20, which is updated to reflect the new or updated event report. That signal changes a status record in the appropriate database to reflect that the new event report is “pending” and needs to be reviewed by a senior analyst 320.

Step 1024. The worklist que 2000 includes a list of country-level reports 2004, a list of submitted event reports 2006 for review, and a list of in-progress or completed event reports 2008.

Step 1026. A senior analyst 320 reviews the worklist que 2000, and selects an event reports by clicking on the provided link. This pulls up the event report in a new window 2100, as shown in FIG. 21, and displays the report text 2102 prepared by the analyst 318. A field 2104 is provided in which the senior analyst 320 may enter comments, questions, or other information. Selection buttons 2106 for “Submit Changes,” “Request Revision,” and “Disapprove” are provided for activation.

Step 1026. Depending upon which selection button 2106 is activated by the senior analyst 320, a signal is then sent back to the worklist que (not shown) associated with the analyst 318 who created the event report.

Step 1028. If the “Submit Changes” button is selected, the event report is approved and the senior analyst 320 may assign an alert level value and/or a staging level value to the event (or confirm the staging value assigned by the analyst 318), which is added to the appropriate database.

Step 1030. A senior analyst 320 may prepare a country-level report (or other report, such as regional-level or at some other geographical limit) by selecting link 2002 (FIG. 20). The country reports are processed, updated, and added to the appropriate database in a similar fashion as the event reports, except the interaction is between a senior analyst 320 and a chief analyst 322. FIGS. 22 and 23 show a reporting template 2200 and 2300 for the senior analyst 320 to use for this purpose. The reporting template 2200 is used to enter factual data, whereas the template 2300 is used to provide an analyst's insight and opinion, explains why a particular agent or circumstance is being monitored, addresses ambiguities or lack of information in the media reporting, may include media reports that are older than a pre-determined time period, may include information from other sources, and allows analysts to draw conclusions and conjecture about, for example, high priority events.

Step 1032. A chief analyst 322 worklist que (not shown, but similar to the worklist que 2000) is updated and displays to a chief analyst 322 a list of country-level reports submitted for review by the senior analysts 320 (other geographical or political levels may also be used, such as reports for specific regions within a country or across multiple countries).

Step 1034. A chief analyst 322 reviews the worklist que, and selects an event report by clicking on a provided link.

Step 1036. Selection buttons are provided to the chief analyst 322, who may click on “Submit Changes,” “Request Revision,” and “Disapprove,” as applicable. Depending upon which selection button is activated by the chief analyst 322, a signal is then sent back to the worklist que 2000 associated with the senior analyst 320 who created the country- or regional-level event report.

Step 1038. Where necessary and requested, the senior analyst 320 then updates the event report that he or she authored.

FIG. 24 is an information and workflow diagram of the back-end information communications subsystem 206, where end users/customers 324 are provided access to various event-related reports and some of the information and data stored in the databases of the present invention.

Step 2402. End users 324 may have access to, and may generate, a number of different displays of event-related information, event stages and alerts information, and an updated master event list or index, among other displays. For example, users 324 may filter and display a master event list. Because many of the display tools offered in the information communications subsystem 206 include location-based and GIS-related information, the system 200 includes non-source-related information and data that are downloaded into the system 200 and stored, and then made available to internal and possibly external users. For example, it may be necessary to download (or upload) GeoNet Name Server (GNS) data, GIS mapping data, and other data into the system 200. When a selection is made using one of the GIS mapping tools, an end user 324 may click on an icon and receive the latest updates from the selected region on a map that displays some of the downloaded data.

Step 2404. An Advisory Board is made available to end users 324, which may be in the form of a list of country- or regional-level event reports 2500 grouped by alert levels 2504, 2505, 2506, as shown in FIG. 25. Each report shown is associated with a hyperlink 2508 that may be activated by clicking on the event report name. The list 2500 is continuously being updated as new event information is received and analyzed.

Once one of the event report hyperlinks is activated, a new window will appear with the event report displayed therein. Figure FIG. 26 is an example of a hypothetical country-level event report for Algeria, and shows the basic information 2602, highlighted information 2604, current alert events 2606, air traffic data information link 2608, and notes 2610.

The air traffic data link 2608 opens up a new window (not shown) or expands the current view to provide for additional information that is filled in on the window being displayed (after being retrieved from a database). Such information may include the number of passenger trips to the U.S. and other locations, a list of cities where direct flights departed and arrived, various trends, and maps depicting routes of air travel. Sources of information may include federal agency transportation statistics and aviation organizations.

The Advisory Board that is accessible to the end users 324 may also be in the form of a map 2700 displayable on a client computer used by the end user 324, with individual icons 2702 on the map 2700 grouped by color-coded alert levels 2704, as shown in FIG. 27. The event world map shown in FIG. 27 represents, geospatially, two types of reports: country reports and event reports. Country reports are represented on the world map by red, orange, and yellow static and flashing indicia, such as icons or symbols (e.g., diamond-shaped icons), and summarize the most important human/animal, plant, and enviroclimatic events that are taking place in a given country, and are based on event-level reporting. Event reports are represented on the world map by static yellow and green indicia, such as icons or symbols, and summarize any event relating to human, animal, or plant health that falls within the reporting requirements of the specific event under surveillance. For example, yellow icons could represent human/animal events, and green icons could represent plant events. Icons may change colors to indicate that they represent multiple categories of event types.

The event world map may be created using Microsoft® Virtual Earth, which provides for scrolling and zooming, 2-dimensional and 3-dimensional views, road, aerial, and bird's eye views, and the ability to label portions of the map.

Each icon 2702 on the event map may flash to indicate an event detected within the last 48 hours (or some other time period). A non-flashing icon 2502 indicates that the event is being tracked. Each flashing icon is also a hyperlink that, when activated, causes a new window to appear with the associated event report, as previously shown in FIG. 26, to be displayed therein. Also, mousing over any one of the flashing icons 2702 causes the event and other information to be displayed, such as the country or province name, title of the event, date and time of last update, the full text of the report, and a link to the full text of the report (here, FIG. 27 shows the title “H5N1 Avian Influenza”). The option to select specific types of reports to view is presented using check boxes, as are specific event types under surveillance, so that only certain events are displayed on the world map. Similarly, a date range may be selected so that only events posted within a specific time period are displayed. Also, a drop down menu (not shown) may be used to quickly zoom to a specific region and only those events associated with that region will be displayed. Computationally, these options are treated as filters on the information stored in databases associated with the system 200. When a selection is made to zoom into a country, point data are displayed to indicate specific city locations (center point of the given geographical regions) of events. Selecting the option to cluster events will display related events (typically related geographically). Selecting a time-line animation will display events on the map in the sequence in which they were posted during a specified time period, providing the end user with information about the propagation and spread of events. Human and other animal staging information is similarly displayed on the map display.

The event map and various displays thereon may be generated in a client computer browser in data communication with a network, such as the Internet or a local area network, from instructions received from an event server, which may be one of or the document collection servers 302 or one of or the document processing servers 304, or both or a different server (not shown).

Step 2406. Another display tool, generally used by analysts 318, 320, 322, is a government off-the-shelf open source program called NASA Worldwind, as shown in FIG. 28. This application is launched as a separate desktop application, and pulls data from the system 200 to populate the overlays, which may include staging and alert information pointing to specific locations on the globe. The Worldwind platform, which has been substantially modified and is called “GeoCenter” in connection with the present invention, provides advanced GIS capabilities and access to real-time environmental data from throughout the world. It is a geospatial visualization tool that allows for multiple data layers to be viewed in geographic context. It has an interface that allows the users 324 to display disparate data sets in relation to the events information to provide enhanced situation awareness. FIG. 28 shows various event-related information for a portion of South America.

The GeoCenter application has file, edit, view, tools, and plug-in menu options. Data layers include imagery data, such as high-altitude Earth images, MS Visual Earth plug-in data, NASA Landsat high-res satellite images, ZoomIt! Data for selected countries, and OnEarth WMS daily MODIS satellite imagery. Other data layers include Argus Situational Awareness (SA) tool ring map (shown in FIG. 28 as an overlay), which displays near real-time Watchboard reports geospatially as mentioned above, and country and enviro-climatic data from CIA World Factbook, placenames, boundaries, and aeronautical information, Naval Research Laboratory (NRL) weather-related data, MODIS imagery, and Global Clouds data. Annotations may be added to the displays.

Step 2408. Prior to accessing the information communications subsystem 206, the end user's 324 login credentials are verified against values stored in one of the databases of the present system 200.

Step 2410. Once access is provided, one or more of the display tools described above is made available. In addition, other tools may be made available, and the end user 324 may create his or her own displays using the data from the information communications subsystem 206, including map displays. FIG. 29, for example, is a timeline 2900 displaying influenza-related event information for China and other countries. The manually-created timeline using desktop applications displays various individual events, and the changing alert levels, using biosurveillance data from the information communications subsystem 206. In this case, the timeline was generated from data and shows an “Advisory” on Jan. 4, 2007, a “Warning” on Jan. 21, 2007, and then several “Watches” at subsequent dates during the biosurveillance period (ending approximately August 2007, in this example).

FIGS. 30A and 30B are exemplary charts of summary and statistical data available from the information communications subsystem 206. The particular data shown are from a measles event case file, which was created using data obtained during the first half of 2007. In this case, a bar chart showing the number of keywords counted in data obtained from information sources 104 relating to a measles event collected during the period Jan. 1, 2007, through Jul. 6, 2007, broken down on a per-country basis is displayed. Alternatively, the number of events per disease (not shown) could be displayed on a separate chart.

FIG. 31 is an exemplary summary chart that may be manually generated using data available from the information communications subsystem 206 or from other sources. In this case, the figure shows time series traces of the number of I&Ws detected in the data obtained from various information sources 104. In this case, the charts were prepared using commercial off-the-shelf software called Tableau. Here, the line charts are related to influenza, where the individual y-axes are counts of keywords associated with parameters under each of the I&Ws shown. Individual time series graphs for each I&W may be summed together as shown in the top chart.

Another chart that could be displayed is a line chart that shows the progression of alert information over time and/or the progression of human and animal staging information.

Turning now to FIG. 32, shown therein is an information and workflow diagram of another aspect of the present invention involving detecting, analyzing, and communicating information about emerging threats in the civil violence context.

Step 3202. The emerging threat workflow involves first identifying incidence of interest, including those affecting or relating to infrastructure, economics, food and water security, proliferation of weapons of mass destruction (WMD), cyber-security, improvised explosive devices (LEDs), human trafficking, and narcotics, among others. Each event can be described by a theme, storyline, event, and action. For example, in the case of Pakistan insurgency, the following types of events or categories of events could be identified as being relevant to subsequent research and analysis: Military-Taliban Clashes; Police-Taliban Clashes; Other Military Offensive; Clashes Involving International Actors and the Taliban; Legal Actions Related to Taliban; Border Security; Taliban Attacks on Civilians or Government Targets; Government (civilian, military or police) Preventative Measures Against Insurgents; Government or International Aid; Taliban Propaganda (including threats) and Government Counter-Propaganda; and the like.

An action is a descriptive code that consists of a subject, verb, object, properties and attributes. Analysts create the action codes from the actor, verb and noun lists contained within the codebook, described below. The code definitions describe the intended use of the codes and what they can and cannot represent. Actions are the most essential elements in conducting quantitative analysis of civil violence, because they function as easily disaggregated data points. Actions are entered into the system by researchers who have surveyed the daily output of articles from Newsstand and other sources and have located relevant articles from which to cull important I&Ws of political instability or other emerging threats. Broken down into its constituent parts an action consists of: one or more Subjects, only one Verb, one or more Objects, zero to several Attributes if applicable, and Properties, i.e., Always a Location, and Dead, Wounded and Participating, when known. Examples include: <Police> <Arrested> <Ethnic Group>; <Military> <Overthrew> <President>. Simple Actions are descriptive codes that consist of a subject, verb, object, properties and attributes. A simple action is an act, deed, or something performed that is objectively presented in a relevant article. Simple actions are pieces of information accepted as fact.

A subject is the actor performing the action. A subject is classified according to predetermined taxonomic categorizations, all of which can be found in the codebook and are listed on the researcher interface. A subject can be Human or Non-Human. A Human subject can be either: Civilian, Government, International. There can be more than one subject in an action code. A subject can be further described through use of a Proper Noun (using the Proper Name function on the interface). A Proper Noun/Proper Name is the actual name of either an organization or an individual, e.g., President would be the taxonomic category of the Actor; the Proper Name could be Barack Obama.

A verb is the act which the subject is performing upon the object. Verbs can be used to express relationships between actors AND verbs can be used to express relationships between an actor and an object (that is not an actor). There are two important subcategories of verbs: (Regular) Verbs and Statement Verbs. Some verbs necessitate an actor as the object, while some verbs necessitate a noun as the object, while a third group of verbs can have either an actor or a noun as the object. In some cases, a verb has no object. The interface will permit or deny the ability to use certain words as objects according to their definition. The taxonomic grouping for Verbs is Violent (2.1), Non-Violent (2.2), and Statement (2.3). Those broad categories may be grouped under taxonomic subcategories, which roughly group similar verbs together, such as: Violent, Jurisprudence, Aggression, Movements, Administrative, Political, General, Statements. Examples include: <Attacked>, <Violated>, <Increased Quantity (Actor)>, <Increased Quality (Actor)>

An object is the person or thing that the subject is acting upon/performing. Objects can be a thing, person, or matter, which are listed as <actors> or a <noun>. An object can be either: an actor OR a noun (NOTE: “nouns” denote inanimate, things and not individuals or associations). Examples include: Aid, Laws, Civilian, Government; all actors and nouns.

An actor is an individual, group, or non-human phenomenon that is able to perform an act. Actors are divided into several conceptual categories: Civilian, Government, Non-Human, International. Actors are generally divided into two broad categories: Human and Non-Human. Human actors are subdivided into: Civilian, Government, International and Non-State Actors. Non-human actors are subdivided into: Medical, Environmental and Economic. Actors can function as the subject or object of an action. Actors have corresponding Nation tags, which denote their country of origin, e.g., <China> <Ethnic Group>. Examples of actors include: Unrecognized State, Gangs, Labor/Unions, Prime Minister, etc.

A noun is a general term for all words that are inanimate objects or abstract things. A Non-Human actor is not considered a noun. “Nouns” can only be objects; they can never be subjects of a code. Nouns are grouped under the following taxonomic subcategories: Attack Types, Weapon Types, Authoritative Action, Government Types, Infrastructure Types, Peace Types, Property Types. Most nouns are identified by the person, group and nation that possesses them. Nouns may be divided into categories, for example to highlight conceptual and functional differences. Nouns are generally a place, thing, or quality that functions as the object of a verb.

An event is an aggregation of related action codes confined to one 24 hour period in one national subdivision (province, state, federal district, etc.), all of which concern similar participants. Examples include: In South Waziristan, Military Kills 20 Militants; In Kabardino-Balkariya, Parliament Passed a Compromised Land-Reform Bill

A storyline is an aggregation of related events. Storylines serve as an organizational tool to arrange events in a way that is useful for users and to help elucidate the most prominent ‘stories’ within a particular ‘Nation of Interest.’ Examples include: “Coalition” Authorities Arrest and Detain Georgians; Religious Issues in Georgia

A theme is an organizational mechanism under which similar stories can be grouped together and reviewed by the client. Examples include: Political Instability—General; Nuclear Security—Proliferation

A statement is a comment made by actors that announce, assign culpability for, deny responsibility for, threaten, request, support or condemn past or future actions. Statement verbs are tagged with a time component that specifies if the comment is in reference to a future (anticipated, will occur) or past (completed, has occurred) action. A distinction is made between statements and actions because actions have definitively occurred or have been acted upon. Statements, on the other hand, are inherently different because they may not be substantiated or carried out as promised. Statements indicate a behavioral pattern by actors. A statement prefix is the part of the code that comes before the main action code. A statement prefix is made up of: an actor who is making a statement and a statement verb describing what type of statement he is making. The main action code is made up of the general action structure: Subject+Verb+Object; the main action is the action about which the actor is making the statement. The main action code is the “heart” of the action. Statement actions highlight announcements of, accusations of, denials of, requests for, support for and condemnation of, past or future simple actions. Statements are classified into Announced, Accused, Denied, Requested, Threatened, Supported, and Condemned categories. Announced statements are statements by an actor declaring that they have performed or will perform an action. Accused statements are statements by an actor that assigns culpability for an action to another actor. Denied statements are statements that either refutes an action or an actor's responsibility for an action. Request statements are statements by an actor asking another actor to perform an action. Threatened statements are statements by an actor declaring that an action will take place if certain pre-conditions are met. Supported statements are statements or official declaration by one actor backing an action made by another actor. Condemned statements are statements or official denouncement by an actor of an action take by another actor. Statement verbs come equipped with a time component indicating whether the actor making the statement is referring to an action that has already occurred or an action that will occur in the future. An actor who is making a subjective assessment of an action that has already occurred will be coded using a statement verb that contains the “-Past” suffix. Any action that has not yet occurred is always considered a statement. These actions are coded using a statement verb that includes a “-Future” suffix.

Properties are pieces of information that are supplementary to the basic description of an action and are not contained in the simple (subject-verb-object) code. They also provide valuable context and nuance to action codes. For example, proper name, position (e.g., rank that an individual occupies within an organization), number dead (e.g., a numeric value indicating the number of people who died during the course of a single action), number wounded (e.g., a numeric value indicating the number of people who were wounded during the course of a single action), number participating involved in an action, and location (e.g., village, town, city, region or country in which an action occurred). Properties are concrete data related to the source article, action duration, and quantity of participants.

An attribute is a piece of information that is supplementary to the basic description of an action and is not contained in the simple (subject-verb-object) code. Attributes provide valuable context and nuance to action codes. An infinite amount of attributes help provide additional detail about an action. There are two broad categories of attributes: First-level—Objective Information (provides additional physical details to the action code; e.g., first-level attributes can specify the weapons used in an attack); and Second-level—Subjective Information (provides subjective context and rationale for an action; i.e., to answer the question, “what was the underlying cause of or motivation for the action?”). The corruption attribute is meant to specify what type of corruption occurred. Corruption is usually used in conjunction with the verb+object code <Violated> <Laws>. Corruption categories include: Electoral, Political, Financial, and Judicial.

Step 3204. Once a particular emerging threat is detected, open source electronic media sources are crawled and “scraped” or “mined” or otherwise searched for relevant information and data about the incidence of interest. The information and data are downloaded. The information may include words, numbers, indicia, quotes, excerpts, images, records, code, scripts, and the like, in any format. An example of information would be an article published on a website downloaded as text in a suitable text file format such as .DOC or .PDF.

Step 3206. The downloaded information and data are processed, including, but not limited to, by ranking the information and data, and/or clustering the information and data, as necessary. Part of this can be done manually, and part can be done automatically. If manually, the downloaded information and data may be skimmed for pertinent information and data.

Step 3208. The information and data are used to form a set of codes from a selectable plurality of codes stored in a codebook (database). This process essentially dynamically and objectively classifies the information and data using various codes so that common or related events have common or similar code values or indicia, making searching and analysis easier. The codebook of actions is a list of objective classification terms that may be related to the information and data of an incidence of interest. The classification terms cover the actors, their action, objects, and properties related to the incidence of interest. The codebook is a dictionary of terms used in the coding process of articles. The codebook is not the same type of dictionary as the Merriam-Webster or Oxford Concise Dictionary. The terms in the codebook have been specially crafted. Some words do not retain the same meanings in normal English as they do in the parlance of emerging threats. Therefore, it is necessary to refer to the definitions in the codebook when coding an action. The codebook defines all of the terms that a researcher encounters when using the coding interface. The code definitions below the codes describe the intended use of the codes and what they can and cannot represent.

Coding follows the following basic approach: first, identify the subject (i.e., translate the subject into the proper taxonomic code); second, identify the act being performed by the subject upon the object (i.e., describe the act using the most pertinent taxonomic code); and three, identify the object (i.e., translate the object into the proper taxonomic code). Table 5 includes a codebook for emerging threats according to the present invention.

TABLE 5 Codebook 1 ACTORS 1.1 Human any individual or organization capable of performing or receiving an action 1.1.1 Civilian any individual or organization whose primary characteristic does not include involvement in the government or government activities 1.1.1.1 Business any individual, organization or company engaged in commerce 1.1.1.2 Displaced Persons any individual or group who cannot live in their homeland due to extenuating circumstances, such as violence, persecution or natural disaster 1.1.1.3 Ethnic Groups a group of people who have similar racial, historical or linguistic characteristics Examples: Tajiks (Central Asia), Flemish (Belgium), Uighurs (China) 1.1.1.4 Gangs a group of actors whose primary goal is to engage in organized criminal activities Examples: Mungiki (Kenya), Mafia/La Cosa Nostra (Italy), MS-13 (Central America) 1.1.1.5 Labor/Unions a group of actors who are either unionized or associated through a professional community with the goal of obtaining improved working conditions, benefits and compensation Examples: Solidarnosc (Poland), AFL-CIO (USA), International Trade Union Confederation 1.1.1.6 NGO an advocacy organization that functions with no participation or representation from any government; the definition includes groups that function locally, nationally and internationally *Note: In cases where the organizations are funded by the government, NGOs maintain their non-governmental status by excluding government representatives in its membership Examples: Amnesty International, Save the Children, Medecins Sans Frontieres 1.1.1.7 Other Civilian any civilian who cannot be assigned a more specific “civilian” code 1.1.1.8 Paramilitary a group of actors organized to serve as a proxy military force or as supplementary military force Examples: UAC (Colombia), ZANU-PF youth militia (Zimbabwe) 1.1.1.9 Political Party a group of actors organized in support of a candidate or organization with the goal of attaining and maintaining political power Examples: PPP (Pakistan), United Russia (Russia), PRI (Mexico) 1.1.1.10 Press/Media any individual or group who gathers information for the public Examples: journalist, newspapers, blogs, etc. 1.1.1.11 Professionals any individual whose vocation is founded upon specialized training Examples: lawyers, economists, financial analysts, doctors, etc. 1.1.1.12 Prisoner an individual who is currently in prison or being detained by police and/or security forces 1.1.1.13 Rebel Group a group joined together in the name of a particular cause (usually political independence from the government) and whose primary tactics involve military force Examples: Taliban (Afghanistan), Lord's Resistance Army (Uganda), Movement for the Emancipation of the Niger Delta (Nigeria) 1.1.1.14 Religious Groups a group of adherents to a particular religion or sect Examples: Sunni Muslims, Catholics, Hindus 1.1.1.15 Students/Youth anyone attending school or university; any adolescent or group of young people 1.1.1.16 VIPs an individual who possesses great influence or prestige, but does not maintain an official role in politics Examples: First Lady, former politicians, children of political leaders, etc. 1.1.2 Government the organization, machinery, or agency through which a political unit exercises authority and performs functions; a group that has the authority to sign and enforce agreements, laws and policies 1.1.2.1 Central Government the centralized or central organization or agency through which a political unit exercises authority and performs functions 1.1.2.1.1 Executive Other any member of the branch of government that has responsibility for the daily administration of the state bureaucracy 1.1.2.1.1.1 President an elected official serving as head of state of a presidential system 1.1.2.1.1.2 Prime Minister the chief executive of a parliamentary government 1.1.2.1.1.3 Minister an individual who has a leadership position in a government ministry 1.1.2.1.2 Judiciary any member, constituent part, or the entire legal court system of a state 1.1.2.1.3 Legislature any member, constituent part, or the whole legislative branch of government 1.1.2.2 State Security Organs the security apparatus of a state, including military, intelligence and police 1.1.2.2.1 Border Guards a group whose primary goal is to protect the state's border from an external threat; the definition includes customs authorities Examples: Border Agency (UK), FPS (Russia), BSF (India) 1.1.2.2.2 Intelligence/Security Services an information gathering security service, such as intelligence, military intelligence, counter-intelligence, and counter-terrorism organizations Examples: ISI (Pakistan), Shin Bet (Israel), Al Mukhabarat Al A'amah (Saudi Arabia) 1.1.2.2.3 Military any individual, constituent part, or the entirety of a state's army, navy or air force 1.1.2.2.4 Police any individual person belonging to, part/unit of, or the entirety of a state's law enforcement service 1.1.2.3 Regional/Local Government a regional or local organization or agency through which a political unit, accountable to the central government, exercises regional or local authority 1.1.2.4 Government Owned Corporation/Agency a legal entity created by a government to undertake commercial or business activities on behalf of the government, including companies where the government owns a controlling stake; includes quasi-governmental organizations, corporations, businesses and agencies where the entity is under the guidance of the government 1.1.2.5 Government Other any government official or institution who cannot be assigned a more specific “government” code 1.1.3 Other entitites, organizations or individuals who do not fit in appropriately under the appellations “civilian” or “government” 1.1.3.1 International Agency any international organization or institution 1.1.3.1.1 IA Diplomatic any international organization whose goal is to foster diplomatic relations between states 1.1.3.1.2 IA Financial any international organization whose goal is to help manage the global financial system and/or provide financial assistance to states Examples: International Monetary Fund, World Bank, World Trade Organization 1.1.3.1.3 IA Humanitarian any international organization whose goal is to provide various types of aid to countries in need Examples: UNICEF 1.1.3.1.4 IA Military any international organization that constitutes a system of collective defense and encourages collective military action Examples: NATO forces, African Union forces 1.1.3.1.5 IA Peacekeepers any international organization whose goal is to provide troops, monitors and other resources to establish and maintain peace in a region Examples: MINUSTAH (Haiti), UNMIT (East Timor), UNTSO (Middle East) 1.1.3.1.6 IA Judiciary an international court Examples: International Court of Justice, International Criminal Court, International Criminal Tribunal for the Former Yugoslavia 1.1.3.2 Non-State Actors an individual actor or group acting independently of a state government 1.1.3.2.1 Mercenaries/Militia soldiers who take part in a conflict for monetary or other gain 1.1.3.2.2 International Corporation a company that operates in at least two countries Examples: Coca-Cola, De Beers, Credit Suisse 1.1.3.2.3 Transnational Terror Organization any individual or organization that uses acts of violence against either civilians or a foreign government with the aim of intimidating the general population or influencing policy decisions by a foreign government Examples: Al Qaeda, Lashkar-e-Toiba *Note: Domestic terror organizations would be classified as ‘gangs,’ ‘paramilitary’ or ‘rebel groups’ 1.1.3.2.4 Unrecognized State a political entity that has a degree of de facto control over territory and is not recognized by the United States as a legitimate, independent state Examples: South Ossetia (Georgia), Nagorno-Karabakh (Azerbaijan), Transnistria (Moldova), Western Sahara, Somaliland, Transnistria 1.1.3.3 Unknown an individual or organization whose identity is not known 1.2 NonHuman any actor that is neither a human being, nor an organization, nor an entity 1.2.1 Accident an unforeseen or natural event or circumstance leading to harm, danger or death Example: car accident, fire, airplane crash 1.2.2 Natural Disaster a natural phenomenon that causes physical damage to structures and/or endangers the local population Examples: floods, hurricanes, earthquakes, etc. 1.2.3 Food Crisis an inadequate supply of food 1.2.4 Water Crisis an inadequate supply of clean, potable water 1.2.5 Health Crisis a significant disease outbreak or a situation during which there is a lack of medical staff, supplies and medications 1.2.5.1 Animal Disease a disease affecting either wild or domesticated animals, or both Examples: anthrax, Swine influenza, Venezuelan hemorrhagic fever 1.2.5.2 Human Disease a disease affecting human beings Example: influenza epidemic, cholera outbreak, HIV/AIDS 1.2.5.3 Plant Disease a disease affecting plants Examples: wheat rust (ug99) 1.2.6 Economic Crisis a situation during which the economy of a country experiences a significant downturn Examples: falling GDP, low liquidity, hypersinflation 2 VERBS 2.1 Violent 2.1.1 Attacked when a human actor kills, injures another actor, attempts to kill or injure another actor, or uses force against another actor; *Note: ensuing physical damage to buildings, structures, etc. need to be coded with the ‘damaged’ verb 2.2 NonViolent 2.2.1 Jurisprudence 2.2.1.1 Arrested when one or more human actors are taken into custody by a country's security forces 2.2.1.2 Investigated a formal inquiry or fact finding mission 2.2.1.3 Litigated Against a legal suit against an individual, organization or governing body 2.2.1.4 RuledAgainst a formal ruling against a human actor in a court or other judicial body 2.2.1.5 RuledInFavorOf a formal ruling in favor of a human actor in a court or other judicial body 2.2.2 Aggression 2.2.2.1 Damaged a human actor destroys or vandalizes, in part or in full, a non-human object *Note: requires a noun (property/infrastructure types) as its object 2.2.2.2 Overthrew a forced change in leadership of a governing body without elections or appointment and accomplished through a non-legal process 2.2.2.3 Seized a human actor obtains an object that was not previously in their possession; looting and stealing are included in this definition *Note: actors cannot be “seized,” they may only be attacked 2.2.2.4 SpiedOn a human actor engages in covert information collection *Note: requires a human actor as its object 2.2.2.5 Barricaded blocking a building, transit route or another location in order to prevent access 2.2.2.6 Demonstrated Against when an individual or organization protests against a group or about an issue 2.2.3 Movements 2.2.3.1 Moved a human actor legally moves him or herself, goods or a third party from one location to another *Note: <moved> is not followed by any object 2.2.3.2 MovedForced a human actor leaves or reaches a location against his free will, including fleeing to avoid future harm; the definition focuses on the movement of internally displaced persons (IDPs) and refugees *Note: the object refers to the person who has been moved 2.2.3.3 Smuggled a human actor illegally transports people or goods from one location to another 2.2.4 Administrative 2.2.4.1 Banned an actor's partial or full restriction of an individual or organization's ability to perform tasks, congregate, etc; definition includes the forced dissolution of a group, political party, business, etc. 2.2.4.2 Approved an actor's approval of something, most often a GovernmentType noun 2.2.4.3 Terminated an actor's negation of an agreement or policy 2.2.4.4 Violated when an actor contravenes an agreement, policy or law 2.2.5 Political 2.2.5.1 Appointed a human actor is placed in an office or position by a governing authority *Note: needs a human actor as object 2.2.5.2 Dismissed the firing, removal or resignation of an actor from his or her position; the diplomatic expulsion of an actor from a country *Note: needs a human actor as object 2.2.5.3 Elected a human actor is voted into a position through an electoral process *Note: needs a human actor as object **Note: election results as recorded as follows: <OtherCivilians> <Elected> <Candidate> 2.2.6 General 2.2.6.1 Decreased Quantity a human actor lowers the amount of an object, actor or action *Note: Does not apply to monetary transactions (see “purchased” and “sold”) 2.2.6.2 Decreased Quality a human actor lowers/reduces the quality of an object, actor or actions *Note: Does not apply to monetary transactions (see “purchased” and “sold”) 2.2.6.3 Increased Quantity a human actor raises the amount of an object, actor or action *Note: Does not apply to monetary transactions (see “purchased” and “sold”) 2.2.6.4 Increased Quality a human actor raises/enhances the quality of an object, actor or action *Note: Does not apply to monetary transactions (see “purchased” and “sold”) 2.2.6.5 MetWith a human actor holds a discussion with another human actor 2.2.6.6 Occurred an action takes place that has a non-human actor 2.2.6.7 Purchased providing money for the acquisition of goods or a service 2.2.6.8 Sold receiving money in exchange for goods or a service 2.3 Statement 2.3.1 Announced a statement about an action made by the actor or representative who performed or will perform the action; the statement cannot contain any preconditions regarding its implementation 2.3.2 Accused a statement that assigns culpability for an action; the individual making the accusation and the individual performing the action must be different 2.3.3 Denied a statement that either refutes that an action occurred or refutes an actor's responsibility for having caused an action 2.3.4 Requested a human actor asks another human actor to take action; the definition includes demands issued by a group 2.3.5 Threatened a statement declaring that an action will be taken if certain preconditions are met; the focus on preconditions differentiates the definition from “announced” 2.3.6 Supported a statement or official declaration backing the action of another actor; a prominent official in the government or civilian organization must make the statement *Note: concrete actions of support, such as increasing aid or providing armaments, will be additionally coded with the appropriate action codes 2.3.7 Condemned a statement or official denouncement of an action by another actor; a prominent official in the government or civilian organization must make the statement *Note: concrete measures, such as sanctions, will be coded with the appropriate action codes 3 NOUNS 3.1 AttackTypes details related to an attack 3.1.1 Kidnappings taking possession of a person against his/her will 3.1.2 Mutilations purposely disfiguring a person 3.1.3 Suicide an attack during which the perpetrator purposely kills or attempts to kill him/herself 3.1.4 Sexual Assaults any attack involving sexual violence 3.1.5 Torture physical or psychological harm against a person with the goal of obtaining information or intimidating others 3.2 Weapon Types instruments and equipment involved in attacks or potential attacks 3.2.1 Aerial a military plane or helicopter and/or any armament deployed from the air 3.2.2 Heavy Arms large equipment, including tanks, missile launchers, etc. 3.2.3 Missiles any missiles/rockets/torpedoes launched from ground-based attack equipment 3.2.4 Explosive Devices any bomb, land mind, or other explosive that may be placed and detonated on-site or remotely without the assistance of heavy arms 3.2.5 Small Arms any small armaments that can be carried by one person Examples: pistols, knives, bows and arrows, machetes, blunt objects, etc. 3.2.6 Unarmed any attack using only parts of the body 3.2.7 Weapons Of Mass Destruction a weapon that can kill large numbers of humans and/or cause great damage to man- made structures or natural structures 3.2.7.1 Biological the use of disease-causing agents as a weapon 3.2.7.2 Chemical the use of toxins (non-biological) as a weapon 3.2.7.3 Nuclear the use of an explosive device using nuclear fission 3.2.7.4 Radiological the use of a device to spread radioactive material 3.3 EventTypes 3.3.1 GroupEvent 3.3.1.1 Barricade an actor-made structure (logs, burning tires, human chains, etc.) designed to restrict the movement of other actors 3.3.1.2 Demonstration when an individual or group meets publicly to manifest discontent against an individual, organization or action; the definition includes protests, marches, strikes and riots 3.3.1.3 Meeting the (usually planned) convening of two or more actors to facilitate discussion 3.3.2 AuthoritativeActions 3.3.2.1 Alert Level when the “alert level” or “readiness” level of armed forces/police/rebels changes 3.3.2.2 Curfew a temporary law that a governing body imposes, which places specific limitations on actors' movements and/or use of specific resources 3.3.2.3 Martial Law a temporary legal code imposing military rule that a governing body may impose to increase their authoritative powers during times of emergency 3.3.2.4 State of Emergency a governmental declaration that temporarily suspends certain normal functions of government, alerts citizens to alter their normal behaviors, or orders government agencies to implement emergency preparedness plans 3.3.2.5 Investigation a formal inquiry or fact finding mission 3.3.2.6 Ban the prohibition of actors to complete a task, congregate, etc. 3.4 GovernmentTypes 3.4.1 Aid goods and/or services provided to a country to help a humanitarian crisis, achieve a socioeconomic, and/or security objective; the definition includes humanitarian, military, and economic aid 3.4.2 Elections the process by which citizens vote for candidates running for political office 3.4.3 Laws legal codes/legislation enforced by a governing body 3.4.4 Taxes any fees or tariffs associated with the exchange of goods or services 3.4.5 Litigation a legal suit against an individual, organization or governing body 3.5 InfrastructureTypes 3.5.1 Electricity a power grid and corresponding supply routes to clients and local populations, including power plants and electrical grids 3.5.2 Internet computer networks and supply routes to users, including cable wires or towers 3.5.3 Road automobile transportation infrastructure, including all streets and highways systems 3.5.4 Sanitation infrastructure related to keeping municipalities clean Examples: sewer systems, recycling plants, water and wastewater treatment plants 3.5.5 Telecom a state's telecommunications systems, including satellites, phone towers, land lines, etc. *Note: communications via computers are coded as “Internet” 3.5.6 Transportation Hub any structure or area where a significant amount of local, national or international transportation occurs Examples: airports, train stations, ports, etc. 3.5.7 Water water supplies and infrastructure *Note: floods are coded as “Natural Disasters” 3.5.8 Food food, food supply and food infrastructure 3.5.9 Energy Infrastructure property, materials and resources related to the extraction, transportation and utilization of energy resources Examples: oil derricks, pipelines, LNG terminals 3.6 PeaceTypes 3.6.1 Ceasefire a suspension of active military hostilities 3.6.2 Relations the relationship between two or more states or two or more intrastate groups 3.6.3 Treaty/Agreement a formal or informal document or a verbal understanding between two or more parties 3.7 Property any physical or intangible entity that is owned by a person or jointly by a group of persons 3.7.1 Building any man-made structure used or intended for supporting or sheltering any non- residential use 3.7.2 Farmland land used to grow plants/feed livestock 3.7.3 Homes any structure used as a residence 3.7.4 Hospital a building used to provide medical care for the sick or wounded 3.7.5 Livestock any domesticated animal kept for its value as a source of food 3.7.6 Vehicle - Other a non-military mode of conveyance 3.7.7 Other Property/Possessions any other property/possessions that cannot be tagged with other property tags

The codebook is used to select tags to capture the actions as accurately as possible. Those tags and data are saved using a graphical user interface 3300, which is shown schematically in FIG. 33. The graphical user interface 3300 may be a website or series of related web pages, forms, fields, drop-down menus, buttons, links, windows, graphics, and the like, and may be part of the previously described information analysis and reporting subsystem 204. The graphical user interface 3300 could be, for example, the aforementioned Watchboard search and reporting module 316.

The first part of coding involves, from the main system interface 3302, identifying the specific emerging threat category for the information and data after it is downloaded, using an emerging threat menu 3304. For example, the emerging threat menu (e.g., a drop down list) could include “Pakistan nuclear weapons,” “Russia-Georgia conflicts,” or “Mexico political instability.” However, it is not necessary to link the threat with the nation. Thus, in another embodiment of the invention, the “threat” is separated from the “nation.” For example, the click down menu for threats may be separate from the click down menu for nation. A user would select a threat (e.g. Political Instability, Nuclear Security, Cyber Security, etc.) and then a nation (Pakistan, India, etc.) prior to entering any action code.

Next, from the interface 3302, an emerging threat location is selected using the location menu 3306, which may include a list of nations of interest, i.e., the specific country being focused on for study by a researcher that is directly associated with the selected emerging threat. For example, if open sources of media about issues concerning nuclear proliferation in Pakistan are being evaluated, then Pakistan would be selected from the location menu 3306. If no nation is identified among the list of nations of interest, one can be added.

Next, from the interface 3302, a date and/or time is selected using the event data menu 3308. The date and/or time corresponds to the data on which the action actually took place.

Also from the interface 3302, a researcher menu 3310 allows a researcher to select their name from a list or enter identifying information in a field that is then stored.

The interface 3302 provides a link or tab 3312 for generating a new event window or form 3312 a, which includes fields for entering event information, specifically an event description field 3402, event date field 3404, a researcher ID field 3406, and a comments field 3408, among others. The comments may include a brief summary of the most significant aspects of the event. A list of events 3410 is also displayed. An example new event window/form 3312 is shown in FIG. 34.

The interface 3302 includes tabs, buttons, links or other features for saving the metadata previously entered, printing, and taking other actions.

The interface 3302 provides a link or tab 3316 for generating a new article window or form 3316 a, which includes fields for entering various information and data, including a URL field 3502 (i.e., a URL associated with the downloaded information and data), and a local URL field 3504 (i.e., a URL associated with where the downloaded information and data are stored in the “Newsstand” document search and reporting interface module 308), a title field 3506. The new article information thus entered may be saved by clicking on “Save Article” button or link 3508. An example new article window/form 3316 is shown in FIG. 35.

The interface 3302 provides a link or tab 3318 for generating a new action window or form 3318 a. Here, a drop-down menu 3602 allows the selection of a nation or region of interest. Another drop-down menu 3604 allows the selection of a proper name, and a link or button 3606 if a proper name is to be added to the list. The subject causing the action to occur is entered from another drop-down menu 3608, each subject being from one of the categories: civilian, government, non-human and international (default is “Civilian”). Properties of the subject are also entered in a text boxes or entry fields 3610 in the subject window 3318. Those properties could include, for example, information related to participating, dead, wounded, and others information, as discussed above. Thus, an entry field (e.g., text box) allows one to describe the number of actors (subject) “participating” or involved in an action. Another field or text box allows one to describe the number of people (subjects) “dead”, i.e., killed during the course of an event. Another field or text box allows one to describe the number of people, who are subjects, “wounded” or otherwise injured during the course of an event. Still another field, text box, or drop-down menu 3612 allows one to add verbs, and a link or button 3614 allows for entering attributes. Another set of fields, drop-down menus, and/or text boxes 3620 could allow one to describe objects, i.e., the actor or the noun (non-human) that receives the action. Still another set of fields, drop-down menus, and/or text boxes 3622 allows one to enter the location of the action, which may be a place name (town, city), country, and/or latitude/longitude. The location information can be indicated by overlaying an indicia (e.g., pin) on a map display 3320, which is part of the interface 3300, and specifically part of the interface 3302. A link or button 3626 allows the location information to be overlaid on a map (e.g., Google Maps) to confirm the correct location has been selected. Once all of the information is entered, the action information can be saved using one of file functions 3314 or by clicking on a link or button 3616, 3624 such as the “Add” buttons shown. A “clear” or “delete” button 3618 allows one to clear all the entered data displayed in the new action interface 3318. An example new action window/form 3318 is shown in FIG. 36.

After selection of pre-determined items from one of the drop-down menus, such as menu 3612, additional drop-down menus or fields may appear. For example, after a verb is selection, then new Subject, Verb, and Object drop-down menus or fields boxes may appear within the original window.

As a general example, from an article title “Kenyan Gang Raids Rival Village” having a first sentence “Early yesterday morning, gangs in Kenya raided a nearby village, killing residents, burning homes, and stealing cattle in the process,” the coding using the codebook might look like Action 1: <Kenya> <Gangs> <Attacked> <Kenya> <Civilian>; Action 2: <Kenya> <Gangs> <Damaged> <Homes> [owned by] <Kenya> <Civilian>; and Action 3: <Kenya> <Gangs> <Seized> <Livestock> [owned by] <Kenya> <Civilian>.

The interface 3302 provides a link or tab 3322 for displaying all pending actions and pending events information previously saved, using, as discussed above, a map, list, matrix, table, graph, or other form or forms of displaying database records of event information. The pending actions and events may automatically be displayed as part of or below the aforementioned windows or forms 3312 a, 3316 a, or 3318 a.

Step 3210. Turning again to FIG. 32, the appropriate taxonomy is used to target the analysis and then forms the basis for filtering the information and data that is identified and subsequently downloaded. As discussed above, a taxonomy relates to specific concepts, such as biosurveillance or emerging threats. The taxonomy for emerging threats is slightly different than the taxonomies used in human and plant biosurveillance, as described previously.

From the taxonomy, a list of parameters, such as synonyms and keywords relevant to the concept, may be used to search for and/or filter the open source information and data. Such synonyms and keywords could include, for example in the case of emerging threats, words like “kill,” “mutilate,” “attack,” “stab,” “slash,” etc. This is similar to the parameters used in biosurveillance taxonomy, as discussed above, such as words like “infect,” “disease,” “contract,” “spread,” etc. Thus, in the case of the civil unrest concept, a taxonomy might include synonyms and keywords about public demonstrations, strikes, overthrow, attacks, assaults, cease fires, etc. In the case of the cyber-security concept, a taxonomy might include synonyms and keywords about denial of service (DOS), blocked addresses, rerouting data, data integrity, etc. In the case of the socio/economic concept, a taxonomy might include currency shifts, economic growth, social changes, food crises, water crises, etc. In the case of the infrastructure concept, a taxonomy might include water, electricity, internet, telecom, roads, sanitation, transportation, etc.

The aforementioned “Newsstand” search engine could be used to provide access to the high-quality news archives available in multiple languages, and for organizing and presenting the news in an intuitive, coded manner, as necessary for the emerging event reporting described above. By way on non-limiting examples of parameters associated with emerging threats in the civil violence domain, a taxonomy for Nuclear Security might yield the following parameters concerning Pakistan, which may also be used as search strings to identify relevant open source materials:

-   -   “Pakistan border” OR “tribal border” OR “illegal trafficking” OR         Sost OR Torkham OR Chamman OR Chaman OR Taftan OR Quetta OR         “Khojak Pass” OR “Toba Kakar Range” OR Khost OR Nushki OR Chagai         OR Dalbandin OR Nok Kundi OR Zhob OR Wana OR “Miram Shah” OR         Bannu OR “Safed Koh Range” OR Dargai OR Baidu OR Arandu OR         Chitral OR Lasht OR Noshag

Similarly, a taxonomy for Nuclear Security might generate the following parameters as that topic relates to Russia:

-   -   (A         OR “         ” OR “T         ” OR “P         ” OR “O         ” OR         * OR “         ” OR “         ” OR (         AND         ) OR (         AND         ) OR “H         -         ” OR         * OR         OR         * OR “B         ”

The interface 3302 provides a link or tab 3324 for displaying a codebook database interface, e.g., a field or text box, for adding, deleting, and modifying codes in the codebook. Additional codes may be added to the codebook when, for example, a research theme is identified. For example, as part of studying emerging threats in Mexico, a theme may emerge that involves cartels, so the codebook may be updated by adding the term “cartel” as a subject, and adding or linking the terms “kidnapped” as a verb, and “narcotics” as an object for cartel.

The interface 3302 also provides a link or tab 3326 for displaying a reporting module or interface for generating written reports concerning emerging threats. Written reports may include, for example, a fact sheet for a particular nation of interest, which could include naming of important towns, cities, provinces, states, regions; naming and providing a brief description of significant political parties and institutions, naming and providing a brief description of important figures in politics and in civil society, identifying significant pieces of legislation, issues (i.e., political, economic, social) facing the nation of interest's society, placed in concise historical background, and identifying culture-specific keywords that are used frequently in public discourse in the nation of interest.

In general, however, keywords may be general, regional, or local in nature. General keywords might include, for example, fundamental words and phrases that begin to focus a keyword search towards civil unrest, civil violence and political changes threat events. Regional keywords might include, for example, words exclusive to a geographic area that enhance a search by focusing attention on regional trends, regional/transnational issues of high importance in the target country, and regional political systems, words that are especially helpful when looking at regional and international press. Local language keywords might include, for example, keywords translated into local language, plus additional words, phrases and idioms that are language-specific, such as important place names, politician names, and oft-discussed dates (dates of independence or massacres).

Step 3212. The information and data are communicated in one or more different ways, including but not limited to, superimposing over maps to improve visualization and further analysis or action. Thus, the interface 3302 includes a link or tab 3320 for displaying a map.

The system 200 and methods of use will now be described by way of particular examples.

Example 1—Influenza 2007

The system 200 has been used retrospectively to detect, globally, social disruption evidence of influenza. FIG. 29 is a timeline of events and associated Alerts at the country level. Beginning at the top of the timeline, on Jan. 4, 2007, the first county-level report for the first influenza season in China was issued with an “Advisory” Alert value. Nine subsequent country-level reports were issued before a “Warning” Alert value was set on Jan. 21, 2007. The “Warning” was issued for China based on I&Ws evident in data obtained from various information sources 104. Those I&Ws included (1) the Guangdon Department of Health denied the existence of influenze outbreak; (2) citizens in Guangzhou were reportedly buying banlagen and other Traditional Chinese Medicine products “in an endless stream”; (3) sales at some pharmacies were up 40% and a shortage of cold medicines were reported; (4) Beijing Hospitals were said to be “deluged” and emergency supplies were being sent; and (5) “dozens” of people were being quarantined. The timeline also shows various Alerts for Argentina, New Zealand, and Australia, including Advisory, Watch, and Warning levels.

Example 2—Retrospective Analyses (Biological)

Retrospective and prospective analyses of biological events involving a variety of pathogens that were responsible for a broad range of social disruption was tested with the present invention. A retrospective case study for Rift Valley fever (RVF), Venezuelan equine encephalitis (VEE), and SARS, as well for influenza-like illness and plague was conducted (see Table 6).

TABLE 6 Disease Location Dates of the Event Influenza-like Kazakhstan February 2005 illness Influenza-like Russia February 2005 illness Plague People's Republic November 2004-December 2004 of China Plague Turkmenistan May 2004-September 2004 Rift Valley Kenya October 1997-February 1998 fever SARS People's Republic September 1802-July 2003 of China SARS People's Republic April 2004 of China SARS Taiwan December 2004 VEE Venezuela March 1995-October 1995

For each of the studies, on-site or online newspaper archives were examined, focusing on local media (online media archives were generally available after 2001). Bilingual team members evaluated articles for information on the emergence of the disease in question and on how the indigenous society responded, with emphasis on the social disruption caused by the event.

The 1995 epidemic of VEE in Venezuela and Colombia presented a complex picture of flood-induced infrastructure collapse; the presence of a disease effecting horses and humans (i.e., VEE); and possibly the presence of other diseases as well, such as dengue fever. In March and April 1995, flooding was reported in local vernacular Venezuelan media. In April 1995, equine health evaluations were reported, but there was no explicit declaration of an outbreak of disease. By June 1995, enough information was available to note the presence of a multi-focal biological event in equines, co-occurring with at least a uni-focal event in humans. In July 1995, infrastructure strain was reportedly related to equine disease (e.g., depletion of local vaccine supplies), along with indications of a multi-focal human disease also present. In August 1995, strain on the medical infrastructure was reported (e.g., hospitals overrun with infected individuals), followed by signs of social collapse in September 1995. Flooding was a key factor promoting the vigorous progression of this epidemic because of not only its effects on expansion of the vector population, but also its direct effect of disrupting multiple sectors of Venezuela's local infrastructure, such as power lines, roadways, and communication. In this example, although documentation of infrastructure collapse due to flooding appeared as early as June 1995, local reporting of social collapse due specifically to disease did not appear until September. It could easily be argued that the effects of flooding on local infrastructure greatly increased the probability of rapid loss of containment. It was later hypothesized this epidemic was due to a possible laboratory accident, highlighting the time-delays inherent in determination of attribution.

In 1802, the emergence of SARS in the People's Republic of China (PRC) appeared to be largely unnoticed by the international community. I&Ws of “unseasonal bad flu” appeared in September 1802 in local Chinese vernacular media. Diagnosis of the pathogen in question would not have been apparent beyond “bad flu”; however “unseasonal” indicated a local awareness of a potential departure from local baseline disease. In October 1802, social anxiety was reported. By November 1802, official concern was expressed regarding potential public panic. In December 1802, an abrupt decrease in reporting, indicating possible information suppression, was a key indicator of a change in local awareness of this novel threat. In January 2003, reports of supply depletions and mobilization of resources appeared, indicating severe shifts in supply and demand. By April 2003, reports documented martial law and rioting due to SARS-related social disruption. This event likely was a complex event involving a variety of respiratory pathogens; to date, there remains uncertainty as to precisely when SARS emerged within this context. In any case, reports of “unseasonal bad flu” in September 1802 and, more important, of social anxiety in October 1802 would have been key to the analyst's assessment of whether unusual disease was present. The overall pattern was one of recurrence, elevation, and diversification of the indications and warnings of a biological event declared unusual followed by reports of containment loss.

As recent history has shown, SARS was not recognized to be a transnational threat until it had translocated through the air traffic grid to eight countries including the U.S. The challenge revolved around near-real time access to transparent disease reporting, understanding of what were indications of social disruption due to containment loss, and effective analysis to determine the nature of what ultimately constituted a true transnational issue.

Example 3—Retrospective Analyses (Non-Biological)

The present system is described with regard to the Indian Ocean tsunami of Dec. 26, 2004. On December 29th, the Pentagon's Chemical-Biological Defense Directorate forwarded a request for tactical situational awareness support from onboard a U.S. Navy vessel as it steamed into Banda Aceh, Indonesia. The system 200 at the time was primarily focused on developing the social disruption I&Ws model for SARS in Guangdong Province, with all programmatic sourcing efforts devoted to that region of the world. However, because the system 200 also leveraged other international sources, the system was able to provide immediate input that a nuclear reactor in India had initiated an emergency shut-down cycle due to the regional earthquake activity. The system also reported on biological events that were reported that week; this was the first situational report issued when compared to WHO, ProMED, and the U.S. Pacific Command's situational awareness support group, the Center for Excellence in Disaster Management and Humanitarian Assistance.

Example 4—Prospective Studies

For prospective case studies, public-domain reports, including news and business reports presented on television, on the radio, over the Internet, and in newspapers, which provided comprehensive coverage of the Pacific Rim from May 1, 2005, to Jul. 15, 2005, were obtained. During this time, 50 biological events involving 13 distinct disease entities were documented; they affected humans in 15 countries and caused varying levels of social disruption (see Table 7).

TABLE 7 Disease Location Time Period Avian influenza in birds Cambodia May 2005 Avian influenza in birds Indonesia July 2005 Avian influenza in birds Vietnam July 2005 Brucellosis People's Republic of China May 2005 Dengue fever Macau May 2005 Dengue fever Indonesia May 2005 Dengue fever Burma June 2005 Dengue fever Thailand June 2005 Dengue fever Bangladesh July 2005 Dengue fever Philippines July 2005 Dengue fever Vietnam July 2005 Diarrhea Philippines May 2005 Diarrhea Bangladesh June 2005 Diarrhea Burma June 2005 Diarrhea India June 2005 Encephalitis Vietnam May 2005 Encephalitis People's Republic of China June 2005 Hemorrhagic fever People's Republic of China June 2005 Influenza-like illness People's Republic of China June 2005 Influenza-like illness Hong Kong June 2005 Influenza-like illness Nepal June 2005 Influenza-like illness New Zealand June 2005 Influenza-like illness People's Republic of China June 2005 Influenza-like illness Vietnam June 2005 Influenza-like illness Australia July 2005 Influenza-like illness Bangladesh July 2005 Influenza-like illness Cambodia July 2005 Influenza-like illness People's Republic of China July 2005 Influenza-like illness Malaysia July 2005 Malaria India June 2005 Malaria Philippines June 2005 Meningitis Thailand June 2005 Meningitis New Zealand June 2005 Meningitis People's Republic of China June 2005 Meningitis India June 2005 Plague People's Republic of China June 2005 Polio India May 2005 Polio Indonesia July 2005 Typhoid People's Republic of China June 2005 Typhoid Fiji June 2005 Typhoid People's Republic of China July 2005

Example 5—Measles

Regional baseline information for the past 10 years was examined across DPRK's neighbors, ROK, Japan, and Russia among others. The study found that measles occurs in 7 year cycles for this area of Asia and, the current outbreak, although at a decadal high, was statistically within the expected norm.

Example 6—H3N1

In addition to DPRK measles outbreak, the H3N1 translocation potential was examined for watches, warnings and advisories based on respiratory disease pattern involving H3N2 and epidemiologic data, attempting to analyze geographical and chronological relationship between H3N2 spread and media analysis. In addition, the connectivity of major U.S. cities to various cities with known H3N2 prevalence in PRC was examined (see Example 16).

Example 7—A/H5N1 and A/H7N7

The present system 200 has served as the lead tactical global event detection system for H5N1 avian influenza, and it was the first in the world to detect the expansion of H5N1 from southern China to Russia and then Eastern Europe. To date, the system 200 has filed over 12,000 reports of events possibly related to A/H5N1 influenza. Although media attention of A/H5N1 has waned in recent months, the present system 200 continues to monitor the global situation with the same level of attention as the first day the system 200 was operational.

In late 2004 and into 2005, aggressive prospective testing of the methodology for the Pacific Rim was undertaken, and resulted in detecting biological events often weeks to months before other traditional capabilities such as ProMED One such event was detection of military call-ups in North Korea later found to be due to a need for bird culling related to avian influenza control. The virus was later found to be H7N7, which had never been documented in Asia before; this finding prompted several questions as to how the pathogen entered North Korea. This information was relayed in person to the U.S. Pacific Command Surgeon General.

Example 8—Hoof and Mouth

The system 200 was the first to provide notice of undiagnosed vesicular disease in cattle in Surrey, UK, that later was diagnosed as hoof and mouth disease. Of additional interest, this event was later found to be the result of a laboratory accident, and intentional release was explored as a possible etiology but later discounted. This event and the tremendous economic damage observed during the last epidemic of the disease in the UK in 2001, highlight the advantages of the present system 200 to focus on diseases that effect human health directly, and also those that effect agriculture.

Example 9—Ebola

The system 200 was the first to report indications of the Ebola epidemic in Kasai, Democratic Republic of the Congo. This information was made available immediately to CDC, and the communications subsystem of the present invention was activated for the entire federal end users 324. CDC's collaboration in rapidly accessing ground verification information highlighted the potential reduction of the time between initial event detection to ground verification to hours and days as opposed to weeks or months. This highlights substantial improvements needed in local disease surveillance, particularly in Africa.

Example 10—Influenza

Influenza kills an estimated 250,000 to 300,000 people globally each year. The present system 200 monitors all influenza strains in support of global influenza surveillance. In a recent influenza season, the system issued nearly 3,000 event reports across 128 countries and 27 languages, which included 181 Advisories, 58 Watches, and 38 Warnings. Hundreds of reports of an H3N2 influenza virus that had possibly drifted away from the current vaccine strain of H3N2 beginning eight months previous to any action were reported in a multitude of countries. The value of that information was validated when the WHO and its partners recommended a change in the southern hemisphere influenza vaccine to include an updated H3N2 strain. This highlights the system's 200 potential contribution to event detection over the long term.

While the previous summary of the invention and illustrative examples have focused on detecting biological events through the assessment of information containing relevant I&Ws suggesting social disruption, the system 200 may equally be applicable to certain non-biological event detection. For example, the system 200 has been demonstrated to be useful in crop surveillance. The system 200 may also be sensitive to events involving nuclear, radiological, chemical, terrorist, political instability, genocide and conflict, crop surveillance, and natural disasters, among other events, as the following examples illustrate. Those events may be important to government, corporate, insurance, financial, commodities, and investment entities and markets.

I&Ws recorded in local newspapers during the pandemics of influenza in 1718, 1957, and 1968 have been evaluated. In addition, the scope of the case studies includes collection and analysis of data from seasonal influenza months during the year prior to each pandemic's appearance (i.e. 1917, 1956, and 1967). This is an attempt to establish a baseline understanding for the evolution of social disruption induced by disease and echoed in open source media. It has been hypothesized that the pandemic years will have a unique signature when compared to seasonal influenza years.

Each data point was assumed to be unique even if multiple points existed from the same date and source with the same taxonomy. In addition, all sources were assumed to be equally reliable, that analysis (associating I&Ws with articles) was consistent throughout study, and finally, that the media and perceptions (disease knowledge, medical advancement, populations) were consistent from each seasonal to pandemic year.

Statistical methods for quantifying captured qualitative data are used. Counts, the numerator alone, are not sufficient because the count of I&Ws reported will vary by the number of articles reviewed, sources available, days of the week, time of year, etc. The use of a denominator allows for calculation of a rate or proportion. The denominator is the total group that could have the characteristics being measured. Denominators considered were sources per day, articles per week, articles on influenza per day, and articles collected per day. Total articles per day (i.e. the cumulative number of articles present among the sources) are most appropriate for the denominator, because it represents the potential I&W containing articles per day.

To date, the a specific taxonomy having 114 parameters has been developed to help distinguish anomalous influenza seasons from those recorded during pandemics. The taxonomy includes the relationships (i.e. hierarchy) between parameters, as well as specific I&W examples from both seasonal and pandemic years. Finally, the taxonomy will continue to evolve as the nature of the data is increased and novel I&Ws are identified. The intent is to produce a signature which enables pattern recognition.

Example 11—Genocide

The system 200 is estimated to be useful in detecting genocide. A study to detect, retrospectively, the genocide from 1998-1999 in Kosovo was initiated (corresponding to the time Serbs launched an offensive against ethnic Albanians in the region). The study analyzed two international news sources for indicators of conflict in order to glean a basic understanding of the signal strength of conflict I&Ws in international media. This was important because a basic assumption of the system 200 is that local media sources will present a stronger signal than international news sources. Although the study was not finished, and a taxonomy of I&Ws for genocide was not created, preliminary findings indicated that the categories of indicators used in the study reflected an accurate portrayal of events preceding the outbreak of widespread violence.

Example 12—Crop Surveillance

Existing biological I&Ws were used in the creation of a unique I&W-based crop & plant taxonomy. Development of the crop pest and disease project has emphasized the necessity of a multi-disciplinary approach within the biosurveillance community. This faceted approach, termed “The One Medicine Concept”, calls on diverse disciplines to enrich the original concepts of threat assessment. This enrichment of established methodology contributes to further development of the One Medicine concept. Volumes of pests and disease, in excess of 20,000 species, as well unsophisticated crop terminology used in media sources, has made adopting the existing bio tier system used in human health insufficient to filter important events from unimportant events. New information sources 312 included farming-related media sources, agricultural workers' unions, crop societies, agriculture/horticulture universities, and crop-related NGOs. To date, 121 sources have been added in English alone. Additionally, agriculture sources have been added in French, Japanese, Korean, Italian, Vietnamese, and Farsi. Monitoring and analysis of two case studies have been conducted. These studies, a tomato epidemic in South America, and Planthoppers in Vietnam and China (see below), allowed for operational testing of the new Crop Disease I&Ws taxonomy. Current crop disease operations detail 61 reporting countries, 202 events, with 314 associated updates, using 32 languages. Crop reporting requirements consist of over 85 commonly categorized pests, diseases, and disease events, recognizing the capacity for additional pests and diseases.

Example 12-1—Rice Crop Disease

The system 200 of the present invention was used to assess rice crop disease events in a retrospective manner. Rice is the staple food of more than 65% of the people of China, and all of the people of Vietnam. In 2006, the total area planted to rice: 7.4 million hectares in Vietnam, 29.4 million hectares in China. China produced 29% of the world's rice in 2006, a total of 183 million tons. The brown planthopper (BPH), Nilaparvata lugens (Stal), vectors two virus pathogens to rice: Rice Grassy Stunt Virus (RGSV) and Rice Ragged Stunt Virus (RRSV). In 2005-2006, rice losses due to BPH were 2.77 million metric tons in China and 400,000 metric tons in Vietnam. Since 2006, rice losses due to BPH are estimated at 1 million tons in Vietnam and 4 million tons in China.

For the analysis, media and other public domain Internet sources were searched iteratively using multiple key word combinations in English, Vietnamese and Chinese. Thousands of sources were searched including media, news, university extension sites, blogs, business and financial reports following the Project Argus methodology. Over 180 Vietnamese and 580 Chinese language information sources yielded crop reports. Sources were searched for direct and indirect I&Ws of potential crop, animal and human epidemics on weekdays from February 2006 to December 2007. These information sources revealed that by Mar. 2, 2006, BPH has infested nearly 65,000 hectares in 16 provinces, 335,000 ton shortfall. By Nov. 5, 2006, BPH and RRSV had spread to 21 provinces, affecting greater than 500,000 ha rice with losses totaling an estimated US $125M. By Dec. 6, 2006, HPAI was re-introduced into Vietnam by, it is believed, smuggled birds. By Mar. 1, 2007, unvaccinated birds found dead, H5N1 positive. They had been taken from an area of HPAI outbreak to Soc Trang to feed in recently harvested rice fields. By Mar. 12, 2007, H5N1 determined to be Chinese strain. Vietnam lifted a 2005 ban on raising ducks and geese; 60 million waterfowl in the country used for reducing feed costs and pest epidemics. Over 100,000 birds were destroyed in 16 provinces; in 2006-2007, 12 infected humans, 9 fatalities in Vietnam. In summary, it was concluded that the present system could detect the interrelatedness between plant health and animal and human health.

Example 13—Nuclear Events

The system 200 was used to conduct retrospective case studies of radiological incidents in Meet Haifa, Egypt; Lilo, Ga.; Jilin, China; Tammiku, Estonia; AQ Khan; and Samut Prakarn, Thailand, to identify potential I&Ws of radiological events. These case studies encompassed time periods from 1960-2001. I&Ws derived from case studies have been used to populate a taxonomy capable of detecting nuclear and radiological events; however, to date, the system 200 has not been used prospectively to identify nuclear events.

Example 14—Biological Weapon Events

The system 200 is estimated to be useful for the detection of covert state and non-state biological weapons programs. The goal of this research has been to build a taxonomy that may be capable of alerting analysts to social disruption I&Ws suspicious for covert biological weapons research; however, to date, the system 200 has not been used prospectively to identify such events.

The former covert Dugway Biological Weapons program was manually examined retrospectively to examine four discrete events conducted by the Deseret test center in order to refine the Dugway analysis: (1) Project “Red Cloud” [Fort Greeley, Ak., November 1966-February 1987]; (2) Project “Watch Dog” [Fort Greeley, Ak., Summer 1967; (3) Project “Green Mist” [Island of Hawaii, Mar. 25-Apr. 24, 1967]; and (4) Project “Blue Tango” [Island of Hawaii, January-March, 1968]. Media articles related from surrounding areas were collected and it was determined that the data collection yielded few I&Ws for those events.

The World War II Biological Weapons deployments of Japanese Unit 731 in Congshan region of Manchura, PRC, was examined retrospectively. Multiple materials related to Unit 731 activities in Manchuria during WWII were collected. Because Unit 731 attempted multiple biowarfare attacks on Manchuria during the WWII time period, just the Ningbo plague drop in 1940 was examined. Since most of the Chinese media articles collected at the various libraries were illegible, two specific materials identified during the collection phase were used: (1) “Japanese Biological Warfare: One Family's Encounter,” by Archie Crouch; and (2) Chugokugawashiryo: Chugokushinryaku to nanasanichi butai no saikinsen, a book that compiled Chinese media articles documenting the plague outbreak during the same time period in Ningbo. Those documents were chosen because they provide two different perspectives of the plague drop: one family's personal experience versus media reporting of the event.

The materials were manually evaluated for reports of biological I&Ws, as well as for reports of the social disruption that accompanied the plague drop. In addition, both materials were examined for possible new I&Ws related to covert biological warfare programs. Finally, the materials were manually examined for possible discrepancies in reporting. That information is estimated to be useful in developing a Biological Warfare taxonomy.

The events involving the Aum Shinrykyo program in Japan, the Rajneeshee program in The Dalles, Oreg., and anthrax in Rhodesia and Namibia events have also been manually and retrospectively examined.

Example 15—Dread Threat Taxonomic Development

A dread threat study of specific biological and non-biological threats in order to develop the theoretical foundation of social disruption theory is known. Slovic et al. originally defined dread risk as the degree to which a technology or activity is perceived to be uncontrollable, dread, catastrophic, hard to prevent, fatal, inequitable, affecting future generations, not easily reduced, increasing, involuntary, and personally threatening (Slovik, et al 2000). Through the study of the 1992 cholera epidemic in Venezuela, the 1997-98 outbreak of Rift Valley fever in Kenya, the 1995 outbreak of VEE, and 1802-2003 outbreak of SARS in China, the local responses to infectious disease outbreaks was evaluated, showing particularly how a local population's sense of an emerging dread threat varies over time.

With this background, examination of other types of threats were undertaken, beginning with a detailed analysis of the Japanese attack of Ningbo, China during the Sino-Japanese War using a bomb containing plague-infested fleas. Through detailed, line-by-line analysis of Japanese Biological Warfare in China: One Family's Encounter by Archie Crouch, a new concept of “menacing context” was developed. The analysis suggests that a menacing context has emerged when a dread threat persists and requires a community to reorganize its life to help mitigate consequences of threat. This concept has applications to the study of other threats, including drought and crop failure, industrial accidents and internecine strife.

Example 16—A/H3N2 (1968)

The system 200 was used in a retrospective study to assess whether I&Ws of the A/H1N1 pandemic influenza in Hong Kong and Southern China (1968) could have been detectable prior to others reporting about the influenza. The study involved developing an I&W taxonomy of pandemic influenza, monitoring media sources for markers of social disruption indicative of anomalous respiratory disease/influenza activity, and then identify a pandemic I&W signature, which could be used to compare the seasonal year (i.e., 1968) with a prior known seasonal period (i.e., 1967).

Development of an infectious disease taxonomy included selecting highly disruptive historical biological events, collected articles from news media archives that directly or indirectly referenced the selected event, and validated the taxonomy by conducting a series of case studies. It was found that I&Ws were present in media reports prior to statements of action from the international public health community, and I&Ws increased in volume and diversified in type as the event progressed.

Development of a pandemic influenza taxonomy included collecting articles from news media archives that directly or indirectly referenced the event (e.g., 1968 pandemic influenza in Hong Kong and Southern China), identifying the societal response to disease baselines during seasonal and pandemic years, and applying the retrospective pandemic influenza taxonomy to the collected relevant articles. The resulting pandemic influenza taxonomy included 93 parameters, including direct I&Ws (one main category) and indirect I&Ws (six main categories), all of which are regional, cultural, and disease-specific. The categories are shown in Table 8 below.

TABLE 8 Main Pandemic Taxonomic Category Definition Report of Human Media reports of illness of a known or unknown Disease disease entity at a uni-focal or multi-focal level. Official Media reports of acceptance or denial of disease Acknowledgement from an official source and/or type of reporting. Community Media reports of acceptance or denial of disease Acknowledgement by a person with accreditation less than an official, but more than locals. Official Action Media reports of implementation of countermeasures, biosurveillance, health alerts, official investigations, and changes to current policies and procedures by authorities at local, national and/or international levels. Integrity of Media reports of compromise or collapse Infrastructure of a society. Local Perception of Media reports of periods of heightened anxiety Threat and public concern within a society. Business Practice Media reports of changes in the economic markets. Changes

In terms of article collection, the system 200 was applied in a target region (Hong Kong/Southern China). Approximately 120 source-months of articles from 1967-1968 were included (accessed at the Library of Congress and Georgetown University Library), including one Chinese-language source and four English-language sources. I&W Main Category Comparison. All indicators increased in 1968 compared to 1967. The greatest increases were noted in the following main taxonomic categories: report of human disease: 23-fold increase; local perception of threat: 15-fold increase; official action: 9-fold increase; individual newspaper articles were more informative in a pandemic year; pandemic year: 1-15 indicators per article; seasonal year: 1-5 indicators per article. All indicators increased at least 15-fold in July 1968 compared to July 1967. Business practice changes was the only exception. Statistically, the overall crude number of taxonomy indicators saw a 7.2-fold increase in pandemic over seasonal year (1968; n=553) versus (1967; n=77) (p=0.004), and the month of A/H3N2 emergence (July 1968) saw a 42.3-fold increase compared to July 1967 (1968; n=254) versus (1967; n=6) (p=0.008).

In summary, the system 200 was used to develop and apply a methodology that converts qualitative information into quantitative and potentially actionable information. The taxonomy-based media surveillance approach according to the present invention is viewed as enabling early detection and situational awareness of an anomalous influenza season. The number of disease indicators per article appeared to increase in the pandemic year compared to the seasonal year in Hong Kong/Southern China. A distinct time-varying signature of indicators appeared to differentiate between pandemic and seasonal influenza years in Hong Kong/Southern China. Georgetown University has applied this methodology, to some degree, using real-time electronically captured media throughout the 2009-A/H1N1 pandemic influenza, as discussed in the next example.

Example 17—A/H1N1 (2009)

The system 200 was used to identify I&Ws of the current A/H1N1 pandemic influenza and the results were compared to reports by other publicly-available media sources, such as HealthMap. Reports of respiratory disease for the 2007-2008 and 2008-2009 respiratory disease seasons were generated from relevant information sources in the form of media articles previously archived by the system 200 as previously discussed. Reports of respiratory disease, including 2009 A/H1N1, in the Spanish and English media in Mexico between Oct. 1, 2007 and May 31, 2009 were reviewed. The rate of reporting defined as a function of time for the study period utilized the ratio of respiratory reports meeting inclusion criteria to the total articles in the archive from Mexican sources. A Shapiro-Wilk test was used to assess the normality of the data. A Wilcoxon rank sum test was used to assess the statistical significance of the difference in mean rate of respiratory disease reporting and stage between seasons.

The results showed an increased prominence of respiratory disease reporting frequency in Mexican Internet sources during the 2008-2009 influenza season relative to 2007-2008. Mean rate of reporting: Oct. 1, 2007 and May 31, 2008 (rate=2.19); Oct. 1, 2008 and May 31, 2009 (rate=4.08) (p<0.0001). Stage of reporting: Oct. 1, 2007 and May 31, 2008 (stage 2 and occasionally stage 3; mean stage=2.35); Oct. 1, 2008 and May 31, 2009 (staged at 2 or less; mean stage=1.80) (p-value <0.0001).

Based on these results, the system 200 found an increased prominence of respiratory disease reporting frequency in Mexican Internet sources during the 2008-2009 influenza season relative to 2007-2008. The stage of respiratory disease reports in the media in the 2008-09 season scored significantly higher in terms of social disruption than reports in the media in the 2007-08 season. The respiratory disease was prevalent in parts of Mexico, and reported as unusual, much earlier than the microbiological recognition of influenza A/H1N1 in late April, 2009. Increased reporting frequency, longer duration of the respiratory season and significantly increased stage (social disruption) together may indicate an anomalous season. Through the analysis of data over time, media reporting baselines can be generated and media signals of anomalous activity identified by the system and method of the present invention, providing clues to emerging outbreaks. The present event-based surveillance tool is complementary to traditional public health surveillance methods utilized to identify an outbreak and track its progression and can cue public health professionals to look for an emerging or changing pathogen earlier than would otherwise be known, and it can provide an effective means for monitoring the spread of disease and severity in a population or region.

Example 18—Emerging Threats (Pakistan)

The system 200 has been used to perform case studies before going operational: Kenya (July 2007-January 2008); Pakistan (January-July 2007; October 2009-March 2010); Philippines (November 2005-January 2006); East Timor (April-July 2006); Guinea (January-February 2007); Georgia/South Ossetia (January-August 2008); and Zimbabwe (June 2007-January 2008), where the dates in parentheses reflect the time parameters studied for the specific country.

In the case of Pakistan, local language electronic media were used in a retrospective analysis to track and trend patterns of political instability, focusing on the intensity of violence involving the Pakistani army and the Taliban. Open source emerging event information and data were limited to attacks by either the Pakistani army or the Taliban-driven insurgency from Oct. 11, 2009, to Mar. 6, 2010. The primary sources were local level papers available electronically (more than 75% of sources are local or regional sources, including more than 100 unique sources in multiple languages). The research for Pakistan was retrospective; however, the data collection involved contemporaneous searches, article collection and coding. The analysts would progress day-by-day as if they were operating in a real-time environment. All events involving a direct, violent attack by either the Taliban or by the military, irrespective of the outcome of such an attack, were recorded. Articles were selected through an automatic process and then provided to analysts to code the information and data using, for example, the interface 3302 discussed above. Articles were coded as actions and events.

The event data suggested that despite military operations in northwestern Pakistan, the level of militant-initiated violence remained relatively consistent over time. Both state and civilian targets, with the exception of one two week time period in December, were attacked at similar levels by the Taliban across Pakistan. While some significant successes was gained by the Pakistani state against the Taliban—including the recapturing of great swathes of formerly insurgent-controlled territory—the militants retained a capacity to launch attacks across Pakistan. The military's offensive on the Taliban remained at high levels through October and November, before decreasing at the end of 2009. On December 12, Prime Minister Yusuf Gilani announced that the military operations in South Waziristan were over. While military actions against the Taliban in South Waziristan continued and a formal end to the military operations did not occur until the end of March, overall military activity in South Waziristan did reduce substantially. The decrease in military attacks may have contributed to the initial spike in Taliban-initiated attacks in late December.

The results are indicative that the surveillance of electronic media in the local language in a near-real time process combined with a rigorous coding taxonomy can provide meaningful insight into the tracking and trending of civil violence and political instability in a complex environment such as Pakistan.

Example 19—Emerging Threats (South Ossetia)

In another emerging threat analysis—the lead up to the South Ossetian War (Jan. 1, 2008-Aug. 8, 2008)—the present invention was used to perform a retrospective analysis of civil violence to evaluate whether I&Ws in media sources could recognize destabilizing events. The South Ossetian War event was selected to test the bounds of the ‘civil violence’ taxonomy, and because it involved an interstate conflict, breakaway regions, unstable neighbors (Russia's North Caucasus Republics), and domestic political instability following a contentious Presidential election. The study was technically a retrospective analysis, but simulated “day-by-day” contemporaneous searches. The data set included 2,583 codes and cited 2,465 articles from 1,328 open sources (75% of which were local media sources, not international sources). Only Russian-language articles were used. Other regional source provided a more balanced perspective. Twenty-seven sources in Georgia as well as others in Ukraine, Azerbaijan, Israel, etc., were included.

The coding system allowed for the ability to identify public relations “statements” by various actors. During the period January 1 to May 30, and average of five statements per week were identified. During the period May 31 to July 11, an average of 16 statements per week were identified. A large spike of 46 statements were identified during the week of Jul. 5, 2008. It was found that public relations was a component of Russian and Georgian foreign policy. Hiring ‘Western’ public relations firms, Russia Today, etc. It was reported that about 50 Russian journalists arrived in Tskhinvali by Aug. 5, 2008. Paul Goble noted that, “both Russian and Georgian leaders were convinced that the way in which the media treated the war was just as important as what took place on the battlefield in determining the winners and the losers.” The spike during the week of July 5th just preceded the “Kavkaz 2008” military exercise viewed as a “dress rehearsal” for the war. The statements subsided during “Kavkaz 2008,” but then rose dramatically once the exercises were complete and after troops were re-positioned in North Ossetia.

From late May to early July, 2008, violence occurred in Abkhazia and South Ossetia in a fairly even manner. Following a lull in violence, which also coincided with the “Kavkaz 2008” military exercise, notable events predominantly occurred in South Ossetia as opposed to Abkhazia. During the week of August 1st, 141 individual actions were identified: 63 actions (or 45%) occurred in South Ossetia; eight actions (or 5.6%) occurred in North Ossetia; five actions (or 3.5%) occurred in Abkhazia; the remaining actions took place in other locations that were engaged in the conflict in some capacity.

Prior to Aug. 1, 2008, there was a fairly even dispersal of actions between Abkhazia and South Ossetia and numerous reports of actions in and near both regions. The shift indicated that South Ossetia was the more likely location for the outbreak of war at the beginning of August 2008. A dip in attacks coincided with the active phase of the “Kavkaz 2008” military exercise.

From January 1 to February 15, there were no reports of meetings involving Abkhazian/South Ossetian authorities. From Jun. 10, 2008, to Aug. 8, 2008, there were three distinct spikes in meetings: week of June 10th (15 meetings), week of July 8th (9 meetings), and week of July 29th (6 meetings). A stark contrast between the beginning of 2008 and the months just prior to the start of the war demonstrates that open source media reports can provide some measure of increased coordination and cooperation between relevant actors and highlights the utility of hyper-local sources. The case-study results suggest that the application of near-real time event data monitoring could have enabled the prospective early detection and warning of the potential for large-scale civil violence.

Example 20—Emerging Threats (Kenya)

The system and method of the present invention were used in a case-study analyzing the indicators and warnings for civil violence and political instability in the months prior to and after (May 1, 2007-Jan. 31, 2008) the Kenyan presidential and parliamentary elections on Dec. 27, 2007.

The data compiled for this study came from 70 distinct Kenyan, diasporic and regional sources; 976 discrete incidents of interest were recorded. That dataset showed a rising vulnerability in areas before there were outbreaks of widespread violence, based on pre-election trends showing escalation in indicators and warnings. The data showed the potential for increased violence in two of Kenya's eight regions and the decreased proclivity to violence in four out of Kenya's eight regions. A review of all indicators and warnings recorded during the period of May 1, 2007 through Dec. 27, 2007 (Election Day) shows two distinct periods of rising indicators and warnings: early May through mid-August and early/mid-October until Election Day. For the former period, these events were mostly concentrated around land violence in Western Kenya, especially the Mount Elgon region, as well as the police crackdown against Mungiki in Nairobi's slums. The latter period recorded instances of politically-related indicators and warnings, especially the trading of accusations by political parties and politically-related violence. The data showed that Rift Valley and Nairobi Provinces would be the regions with the greatest possibility of falling into deadly conflict following the 2007 elections. While the varied ethnic makeup of the Rift Valley indicates predisposition to violence in Kenya, it merits being stated that past violence has not always been as disproportionately skewed to the Rift Valley as it was in 2007. In other words, the dataset offered hard, empirical evidence as to increased vulnerability to violence in the Rift Valley, instead of just assuming predisposition to violence because of the province's ethnic make-up and conflict history. The post-election outcome bore out the reality of what the system had indicated in these two provinces. Similarly, the data showed that Central, Eastern, Northeastern, and Coast Provinces would all have reduced or lower levels of violence compared with the Rift Valley and Nairobi.

The authors of this case-study found that by tracing the outbreak of local-level emerging threats in Kenyan media as early as two to three months before the election, one could observe the increase in the vulnerability of Kenya to large-scale violence. The surveillance of open source local language media in Kenya demonstrated that contrary to depictions in the international media, this conflict was not an unexpected or unforeseen event. Through the recording of local-level emerging threats in Kenyan media as early as several months before the election, one could observe an increase in Kenya's vulnerability to large-scale political and ethnic violence. In contrast, international media and analytic sources failed to accurately anticipate or identify the clear indicators and warnings of violence prior to the December 2007 elections.

Although certain presently preferred embodiments of the disclosed invention have been specifically described herein, it will be apparent to those skilled in the art to which the invention pertains that variations and modifications of the various embodiments shown and described herein may be made without departing from the spirit and scope of the invention. Accordingly, it is intended that the invention be limited only to the extent required by the appended claims, prior art, and applicable rules of law. 

We claim:
 1. A computer-aided system for detecting and communicating information regarding a socially disruptive event having an event type, the system comprising: non-transitory computer readable storage media that stores: indicators associated with a plurality of event types, the indicators indicative of social disruption; and keywords and synonyms associated with the plurality of event types, the plurality of event types comprising at least one of emerging threats, biological events, civil unrest, cybersecurity events, socioeconomic events, events impacting food or water security, or infrastructure events; an information collection and processing subsystem that identifies and downloads publicly available documents relevant to the socially disruptive event by searching open source electronic media sources for the indicators associated with the event type of the socially disruptive event, the socially disruptive event comprising at least one of an emerging threat, a biological event involving humans, a biological event involving animals, a biological event involving plants, a political event, a socioeconomic event, an economic event, an industrial event, an infrastructure event, an environmental event, civil unrest, dislocation, riots, violence against property, violence against people, a cybersecurity event, events impacting food or water security, proliferation of weapons of mass destruction, improvised explosive devices, human trafficking, narcotics, public opinion about a political topic, public opinion about public policy, public opinion about religion, public opinion about entertainment, a natural disaster, natural disaster aftermath, natural resource exploitation, or military activity; a graphical user interface that provides functionality for a user to identify one or more of the stored synonyms and keywords; and an information analysis and reporting subsystem that: searches or filters the downloaded documents using the synonyms and keywords identified by the user; and generates and outputs event reports including information included in the downloaded documents.
 2. The system according to claim 1, wherein the graphical user interface further provides functionality to edit the keywords and synonyms associated with the plurality of event types.
 3. The system according to claim 1, wherein the information analysis and reporting subsystem automatically ranks or clusters the information included in the downloaded documents using one or more machine learning models.
 4. The system according to claim 1, wherein the graphical user interface further provides functionality for the user to rank or cluster the information included in the downloaded documents.
 5. The system according to claim 1, wherein the graphical user interface is adapted to display an event stage table containing scale values for assigning a degree of severity to the socially disruptive event.
 6. The system according to claim 1, wherein the graphical user interface further provides functionality for the user to identify a specific emerging threat from a predefined list of emerging threats.
 7. The system according to claim 1, wherein the graphical user interface further provides functionality for the user to identify an event location from a predefined list of nations of interest.
 8. The system according to claim 1, wherein the graphical user interface further provides functionality for the user to identify a date and/or time that the socially disruptive event took place.
 9. The system according to claim 1, wherein the indicators indicative of social disruption comprise at least one of reports of human, animal, or plant disease, official acceptance or denial, official action, demand for medical services, reports of local perception of threat, business practice changes, compromise or collapse of integrity of infrastructure, temperature changes, perception changes, or suggestion that individuals or organizations have begun changing their daily routine activities.
 10. The system according to claim 1, wherein social disruption comprises alteration in the normal functioning of a social system that, under extreme conditions, can lead to the collapse of social infrastructure.
 11. The system according to claim 1, wherein the publicly available documents include at least one of a hypertext markup language (HTML) document, an extensible markup language (XML) document, a portable document format (PDF) document, text, an image, an audio file, or a file created from a radio frequency signal intercepted through the air, sea, or a space platform.
 12. The system according to claim 1, wherein the publicly available documents include at least one of words, numbers, indicia, quotes, excerpts, images, records, code, or scripts.
 13. The system according to claim 1, wherein the open source electronic media sources are web servers publicly accessible via the Internet.
 14. A computer-aided method for detecting and communicating information regarding a socially disruptive event having an event type, the method comprising: storing indicators associated with a plurality of event types, the indicators indicative of social disruption; storing keywords and synonyms associated with the plurality of event types, the plurality of event types comprising at least one of emerging threats, biological events, civil unrest, cybersecurity events, socioeconomic events, events impacting food or water security, or infrastructure events; identifying and downloading publicly available documents relevant to the socially disruptive event by searching open source electronic media sources for the indicators associated with the event type of the socially disruptive event, the socially disruptive event comprising at least one of an emerging threat, a biological event involving humans, a biological event involving animals, a biological event involving plants, a political event, a socioeconomic event, an economic event, an industrial event, an infrastructure event, an environmental event, civil unrest, dislocation, riots, violence against property, violence against people, a cybersecurity event, events impacting food or water security, proliferation of weapons of mass destruction, improvised explosive devices, human trafficking, narcotics, public opinion about a political topic, public opinion about public policy, public opinion about religion, public opinion about entertainment, a natural disaster, natural disaster aftermath, natural resource exploitation, or military activity; providing a user interface that provides functionality for a user to identify one or more of the stored synonyms and keywords; searching or filtering the downloaded documents using the synonyms and keywords identified by the user; and generating and outputting event reports including information included in the downloaded documents.
 15. A computer-aided system for detecting and communicating information regarding a socially disruptive event having an event type, the system comprising: non-transitory computer readable storage media that stores: indicators associated with a plurality of event types, the indicators indicative of social disruption; and a predefined list of subjects, a predefined list of objects, and a predefined list of acts specially crafted to characterize events of the event type; an information collection and processing subsystem that identifies and downloads publicly available documents relevant to the socially disruptive event by searching open source electronic media sources for the indicators associated with the event type of the socially disruptive event, the socially disruptive event comprising at least one of an emerging threat, a biological event involving humans, a biological event involving animals, a biological event involving plants, a political event, a socioeconomic event, an economic event, an industrial event, an infrastructure event, an environmental event, civil unrest, dislocation, riots, violence against property, violence against people, a cybersecurity event, events impacting food or water security, proliferation of weapons of mass destruction, improvised explosive devices, human trafficking, narcotics, public opinion about a political topic, public opinion about public policy, public opinion about religion, public opinion about entertainment, a natural disaster, natural disaster aftermath, natural resource exploitation, or military activity; and an information analysis and reporting subsystem that provides a graphical user interface that provides functionality for a user to: characterize an action described in each of the downloaded documents by selecting a subject from the predefined list of subjects, selecting an object from the predefined list of objects, and selecting an act being performed by the subject on the object from the predefined list of acts; and generate and output event reports characterizing the actions described in the downloaded documents.
 16. The system according to claim 15, wherein the graphical user interface further provides functionality to edit the predefined list of subjects, the predefined list of objects, or the predefined list of acts.
 17. The system according to claim 15, wherein the information analysis and reporting subsystem automatically ranks or clusters information included in the downloaded documents using one or more machine learning models.
 18. The system according to claim 15, wherein the graphical user interface further provides functionality for the user to rank or cluster information included in the downloaded documents.
 19. The system according to claim 15, wherein the graphical user interface is adapted to display an event stage table containing scale values for assigning a degree of severity to the socially disruptive event.
 20. The system according to claim 15, wherein the graphical user interface further provides functionality for the user to identify a specific emerging threat from a predefined list of emerging threats.
 21. The system according to claim 15, wherein the graphical user interface further provides functionality for the user to identify an event location from a predefined list of nations of interest.
 22. The system according to claim 15, wherein the graphical user interface further provides functionality for the user to identify a date and/or time that the socially disruptive event took place.
 23. The system according to claim 15, wherein the indicators indicative of social disruption comprise at least one of reports of human, animal, or plant disease, official acceptance or denial, official action, demand for medical services, reports of local perception of threat, business practice changes, compromise or collapse of integrity of infrastructure, temperature changes, perception changes, or suggestion that individuals or organizations have begun changing their daily routine activities.
 24. The system according to claim 15, wherein social disruption comprises alteration in the normal functioning of a social system that, under extreme conditions, can lead to the collapse of social infrastructure.
 25. The system according to claim 15, wherein the publicly available documents include at least one of a hypertext markup language (HTML) document, an extensible markup language (XML) document, a portable document format (PDF) document, text, an image, an audio file, or a file created from a radio frequency signal intercepted through the air, sea, or a space platform.
 26. The system according to claim 15, wherein the publicly available documents include at least one of words, numbers, indicia, quotes, excerpts, images, records, code, or scripts.
 27. The system according to claim 15, wherein the open source electronic media sources are web servers publicly accessible via the Internet.
 28. A computer-aided method for detecting and communicating information regarding a socially disruptive event having an event type, the method comprising: storing indicators associated with a plurality of event types, the indicators indicative of social disruption; storing a predefined list of subjects, a predefined list of objects, and a predefined list of acts specially crafted to characterize events of the event type; identifying and downloading publicly available documents relevant to the socially disruptive event by searching open source electronic media sources for the indicators associated with the event type of the socially disruptive event, the socially disruptive event comprising at least one of an emerging threat, a biological event involving humans, a biological event involving animals, a biological event involving plants, a political event, a socioeconomic event, an economic event, an industrial event, an infrastructure event, an environmental event, civil unrest, dislocation, riots, violence against property, violence against people, a cybersecurity event, events impacting food and water security, proliferation of weapons of mass destruction, improvised explosive devices, human trafficking, narcotics, public opinion about a political topic, public opinion about public policy, public opinion about religion, public opinion about entertainment, a natural disaster, natural disaster aftermath, natural resource exploitation, or military activity; providing a user interface that provides functionality for a user to: characterize an action in each of the downloaded documents by selecting a subject from a predefined list of subjects, selecting an object from a predefined list of objects, and selecting an act being performed by the subject on the object from a predefined list of acts, wherein the predefined list of subjects; and generate and output event reports characterizing the actions described in the downloaded documents. 